Welcome to K4/Clarissa Blog which explains intentions of our project and gives an overview about work in progress. At the end of that Blog we added posts and articles of our predecessor project DSBENCH a database benchmark tool.
Recent Posts
Rewrite Testing
Most effort in the fading year 2024 went into cleaning code and test data, substantially enhancing the test environment and documentation as part of finally making the project open to the public.
To make clear that this is a time consuming job, keep in mind that:
currently there is only one developer… me the project exists since 2015, some precursor projects like dsbench even earlier the project currently contains about: 308T Python codelines 38T Pascal codelines (most of this was dropped finally) 84T Procedural SQL codelines 23T Lines of documentation and about 685MB of (partially self-generated) test data As a result of rewriting the test environment it is now possible to make a self-test of Clarissa-NET abilities.
read more
Intended K4/Clarissa Presentation at Linux Tag 2024 in Chemnitz
Initially, I planned to present the current state of K4/Clarissa Chemnitzer Linux Tage 2024 but it has been rejected, unfortunately. Please, feel free to read abstract and a short paper originally submitted there. Additionally, I added a short article in German instead, that describes what was intended to be shown.
read more
A Call to Support our Project
In the meanwhile K4/Clarissa contains about a decade of work of two. We developed a lot of concepts and tested database engines (see our DSBENCH project). While the software is growing it becomes more and more clear that we need supporters who can contribute at many construction sites we still have, for instance:
GUI conception and design: More eyecandy widgets, matching color palettes, GUI concept and workflow optimization. Although Clarissa is already able to “speak” English and German, it would be fine to give all dialogs a revision.
read more
Why not available for Download?
Maybe, some of you will ask why our software (although its intended as Open-Source) isn’t available for download so far.
The reason is quite simple:
We have high demands on quality of a piece of software before we publish. Most of the software is written in Python3. We already made some tests with cx_Freeze to create a “compiled” release, but there is still a lot to do to get reliable results.
read more
Clarissa - an Introdution
The GUI of the K4 project Clarissa contains an included help and description. I compiled the introduction and the index of the current state (in German) of that documentation.
read more
Clarissa Net in a Nutshell
Clarissa Net is simply a network consisting of K4/Clarissa users which are able to communicate to each other. Be noted, that the communication is not limited to K4/Clarissa users only, but the scope of communication is vastly extended if both are using K4/Clarissa.
K4/Clarissa includes several systems to communicate to others. We call such systems transports and they are based on known methods like Telegram (https://telegram.org), Matrix (https://matrix.org), XMPP (https://xmpp.
read more
State Report 11/2023
After a long time of silence on this site its time to give at least a short update about progress on our K4 project. Contrary to the picture that our web presence gave to you the project is still active and experienced some fundamental changes. The following notes only give a short and incomplete overview about what happened (between 2019 and end of 2023):
Our original intention to implement the GUI of K4 (which is also called Clarissa) in Object Pascal turned out to be too limited - so we decided to completely redesign it in Python3 + QT5.
read more
CLT 2019 Presentation for Download
Presentation at Chemnitzer Linux Tage 2019 event is done now. Please, feel free to download the presentation or to read a detailed treatment (both in German).
read more
K4 Presentation at Linux Tag 2019 in Chemnitz
We plan to present the current state of K4 for the first time at Chemnitzer Linux Tage 2019. Our presentation will be in German. Please, feel free to read abstract and a short paper.
read more
Example: ENRON
In order to test our e-mail import module and the word indexer we picked up the ENRON e-mail corpus and loaded it into our database.
ENRON was one of the biggest US energy companies with headquarter in Houston, Texas. In 2001 ENRON caused a big scandal due to continued balance forgery. During crime investigations all e-mails of the company were confiscated. In the meantime these e-mails are released for computer and programming science purposes.
read more
Handling E-Mail
Over the last years communication between people has changed substantially by the use of smart phones and messengers like WhatsApp and Telegram. Nevertheless, communication by e-mail is still one of the most important ways of exchanging digital data and information. This is even more true for companies and authorities.
Hence, e-mail is a key element in the management of personal and private data. From a technical perspective a single e-mail contains communication content and besides of that also a lot of useful information in the header, for instance receive time, addresses and names of communication partners.
read more
The Pillars of K4 - An Overview
Applications that collect searchable data (basically every messenger) usually use a SQLite database. SQLite is a very simple relational SQL database without any procedural programming capabilities. Datacentric applications, even those which use far more advanced database products, often view a database as a simple data storage container from which data can be retrieved very fast. If no attempt is made to use relational features, a database often is reduced to a centralized data store, used by some orthogonal persistence layer which is fed data by some middleware.
read more