Language Selection

English French German Italian Portuguese Spanish

The Road to KDE 4: Strigi and File Information Extraction

Filed under
KDE

After a short delay due to a heavy dosage of Real Life(tm), I return to bring you more on the technologies behind KDE 4. This week I am featuring Strigi, an information extraction subsystem that is being fully deployed for KDE 4.0. KDE has previously had the ability to extract information about files of various types, and has used them in a variety of functional contexts, such as the Properties Dialog. Strigi promises many improvements over the existing versions. Read on for more...

Strigi is a library that sits at a lower level than KDE. It is written in C++, and is designed to present a series of generic calls that a program can use to find more information about a given file or files. It is in no way tied to KDE except that the development version lives in KDE's SVN repository. It also has search capabilities, which are not really the focus of this article.

The Strigi libraries are used to get information from within files, such as the dimensions of an image, or the length of an audio clip, embedded thumbnails, number of lines in a log, source code licensing info or just to search a text file for a given string. Strigi has other advantages, as it can work inside compressed files, archives, and so forth seamlessly. In fact, it ships a few useful utility programs, called deepgrep and deepfind. These useful command line programs allow you to search for information within binary file formats as easily as using grep or find on plain text files. KDE is inheriting the same libraries, so we also get this unique advantage of being able to pull information out of files that are buried within binary formats, such as .tgz files.

Full Story.

More in Tux Machines

The Importance of BSD

The Berkeley Software Distribution (BSD) is a Unix operating system developed by the Computer Systems Research Group (CSRG) of the University of California, Berkeley. Read more

Ubuntu 16.10 Unity and Ubuntu MATE

  • Ubuntu 16.10 Unity 8 / Mir - Current State
  • Bytemark sponsor Ubuntu MATE
    A couple of weeks ago the Bytemark Managing Director, Matthew Bloch, contacted the Ubuntu MATE team to offer free hosting for the project. As of August 18th 2016 all the Ubuntu MATE infrastucture is hosted on Bytemark Cloud Servers.
  • Ubuntu MATE 16.10 Beta 1
    We are underwhelmed to announce, quite possibly, our most uninteresting beta release E-V-E-R! ;-) This beta release is all about the plumbing that transitions Ubuntu MATE to GTK 3.20. It really isn’t very interesting from an end-users perspective.

Linux Foundation and Linux Birthday

LWN at GUADEC

  • Flowgraphs in GTK+
    At GUADEC 2016 in Karlsruhe, Germany, Daniel "grindhold" Brendle presented his work developing a new library and widget set that will allow GTK+ applications to implement flowgraphs in a standard manner. The widget set would enable applications to provide interactive widgets for linking filters and other block-oriented components—a type of interface many applications currently need to reinvent on their own. Flowgraphs, Brendle explained, are a general-purpose diagramming technique that many people will recognize from textbooks and other printed matter. They show how objects, information, and signals flow through some sort of process. Biology textbooks use them to illustrate circulation in the body, technical manuals use them to show how a manufacturing process runs, and so on. In software, he said, they are most familiar as the node-and-pipe diagrams that illustrate signal processing or data filtering.
  • The GNOME Newcomers initiative
    At GUADEC 2016 in Karlsruhe, Germany, Bastien Ilsø and Carlos Soriano reported on the revamped Newcomers section of the GNOME web site. The section is intended to draw in new users and developers and help them find their way around the project as well as to help them get the necessary development environment set up to begin contributing code.