Language Selection

English French German Italian Portuguese Spanish

The Road to KDE 4: Strigi and File Information Extraction

Filed under

After a short delay due to a heavy dosage of Real Life(tm), I return to bring you more on the technologies behind KDE 4. This week I am featuring Strigi, an information extraction subsystem that is being fully deployed for KDE 4.0. KDE has previously had the ability to extract information about files of various types, and has used them in a variety of functional contexts, such as the Properties Dialog. Strigi promises many improvements over the existing versions. Read on for more...

Strigi is a library that sits at a lower level than KDE. It is written in C++, and is designed to present a series of generic calls that a program can use to find more information about a given file or files. It is in no way tied to KDE except that the development version lives in KDE's SVN repository. It also has search capabilities, which are not really the focus of this article.

The Strigi libraries are used to get information from within files, such as the dimensions of an image, or the length of an audio clip, embedded thumbnails, number of lines in a log, source code licensing info or just to search a text file for a given string. Strigi has other advantages, as it can work inside compressed files, archives, and so forth seamlessly. In fact, it ships a few useful utility programs, called deepgrep and deepfind. These useful command line programs allow you to search for information within binary file formats as easily as using grep or find on plain text files. KDE is inheriting the same libraries, so we also get this unique advantage of being able to pull information out of files that are buried within binary formats, such as .tgz files.

Full Story.

More in Tux Machines

QNAP and Canonical Optimize Ubuntu For IoT Purposes

The Internet Of Things movement has attracted a lot of attention in recent years. Not just enthusiasts, but also major firms in the technology sector are working on developing new IoT initiatives. A Partnership between QNAP and Canonical will help optimize Ubuntu on NAS systems for Internet of Things applications. Read more

Mozilla turns Firefox OS into IoT hub

As an operating system, Firefox OS has undergone a massive transformation in the past 24 months – it’s far more than just a web browser nowadays. But now Mozilla is looking to take Firefox to the next level by using it as a hub for a plethora of Internet of Things projects. Mozilla is currently working on four IoT projects behind the scenes: Project Smart Home, Project Link, Project Sensor Web and Vaani. Each of the projects will deal with IoT technology in different ways, but all are aimed at making the end consumer’s home and devices smarter. In a blog post, Mozilla’s SVP of Connected Devices, Ari Jaaksi, posted: “Everything is connected around us. This revolution has already started and it will be bigger than previous technology revolutions, including the mobile smartphone revolution. Internet of Things, as many call it today, will fundamentally affect all of us.” Read more

Kicking the Tires on Arch Based Antergos

We decided to take the Arch Linux based distribution Antergos out for a test drive. Here’s how it handled, out in traffic and on the track. A few months back, a fellow tech writer mentioned in an email exchange that I might try using the Arch Linux based Antergos distro as a way to grab the latest and greatest versions of popular software titles for review. Mainly because of Arch’s community repositories, in which users suggest and vote on packages to be included, many popular software titles are available within days after a new release. And since Antergos is a simple install compared to Arch, it would be easy to quickly throw up an installation on a test machine just to look at the latest and greatest from LibreOffice, GIMP and the like. Read more

Red Hat and Fedora