Language Selection

English French German Italian Portuguese Spanish

The Road to KDE 4: Strigi and File Information Extraction

Filed under
KDE

After a short delay due to a heavy dosage of Real Life(tm), I return to bring you more on the technologies behind KDE 4. This week I am featuring Strigi, an information extraction subsystem that is being fully deployed for KDE 4.0. KDE has previously had the ability to extract information about files of various types, and has used them in a variety of functional contexts, such as the Properties Dialog. Strigi promises many improvements over the existing versions. Read on for more...

Strigi is a library that sits at a lower level than KDE. It is written in C++, and is designed to present a series of generic calls that a program can use to find more information about a given file or files. It is in no way tied to KDE except that the development version lives in KDE's SVN repository. It also has search capabilities, which are not really the focus of this article.

The Strigi libraries are used to get information from within files, such as the dimensions of an image, or the length of an audio clip, embedded thumbnails, number of lines in a log, source code licensing info or just to search a text file for a given string. Strigi has other advantages, as it can work inside compressed files, archives, and so forth seamlessly. In fact, it ships a few useful utility programs, called deepgrep and deepfind. These useful command line programs allow you to search for information within binary file formats as easily as using grep or find on plain text files. KDE is inheriting the same libraries, so we also get this unique advantage of being able to pull information out of files that are buried within binary formats, such as .tgz files.

Full Story.

More in Tux Machines

today's howtos

The Red Hat Way

  • Red Hat wants to make cold-shouldered OpenStack red hot
    At OpenStack Summit in Boston last May, some speculated that the event might be the last gasp for OpenStack — an open-source platform for cloud computing and infrastructure-as-service. Granted, OpenStack was one of the less hyped open-source projects of the past year. But renewed community and end-user interest is breathing fresh life into the platform, according to Rob Young (pictured), senior manager of virtualization product and strategy at Red Hat Inc. Telcos and others are adopting OpenStack “because of the simplification of what was once complex, but also in the cost savings that can be realized by managing your own cloud within a hybrid cloud environment,” Young said.
  • Improved multimedia support with Pipewire in Fedora 27
    Pipewire — a new project of underlying Linux infrastructure to handle multimedia better — has just been officially launched. The project’s main goal is to improve the handling of both audio and video. Additionally, Pipewire introduces a security model to allow easy interaction with multimedia devices from containerized and sandboxed applications, i.e. Flatpak apps.
  • Architecting the future with abstractions and metadata
    The modern data center is built on abstractions, with Docker, Kubernetes, and OpenShift leading the way.

Games: Racing Games, Steam, SteamWorld Dig 2, XCOM 2: War of the Chosen

Software: DNS Checkers, Alternatives to Adobe Software, Fake Hollywood Hacker Terminal and More