Language Selection

English French German Italian Portuguese Spanish

Linux Desktop Search Engines Compared

Filed under
Software

I have a large electronic library (over 15,000 books) and I was looking for a way to cope with this mass of information. I didn't like the idea of a special catalog, since it would take a lot of manual work to enter the metadata. Besides, my books are in various formats, from HTML, to RTF, to DOC, to PDF, to DjVU. These files lack metadata way too often and I thought a local indexing service with a full-text search might solve my problem. I knew there are more options to choose from than just Google, but I could not find a good modern comparison. Even the table in Wikinfo's Comparison of desktop search software contained too many errors, as I discovered.

I had to compare them myself.

My task imposed certain restrictions on the one hand, but made the others irrelevant on the other hand. So, I was especially interested in a wide gamut of file types, in the ability to add new ones (Epub, fb2, html.zip) and in extensive query language. All software, except for GDS and DocFetcher, was installed from Ubuntu 9.10 repositories.

I have no special preferences regarding the backend, it may be Xapian- or Lucene-based tool, or even a custom backend. On the other hand, Xapian usually requires more disk space, and there is never too much space on desktops.

Rest Here




More in Tux Machines

Mesa 10.3.2 Has A Couple Bug-Fixes

For those living by stable Mesa releases rather than the exciting, bleeding-edge Mesa Git code for open-source Linux graphics drivers, Mesa 10.3.2 is available this Friday night. Mesa 10.3.2 has fixes for Nouveauy's GM107 Maxwell and GK110 support, a handful of Intel DRI driver fixes, and also a few R600g/RadeonSI driver fixes. Mesa stable users interested in learning more can find the 10.3.2 release announcement by Emil Velikov, the new Mesa release manager. For those after the latest Git developments, Mesa 10.4 will be declared stable in December. Read more

openSUSE Tumbling, Fedora Slipping, and Calculating Linux

The big news today is the merger of openSUSE Factory and Tumbleweed. Fedora 21 is delayed again due to numerous blockers. Jack M. Germain looks at Calculate Linux 14 and Bryan Lunduke is back with another desktop review, this week LXDE. There's a "victory for free software" in the news, but it's not in Berlin where Microsoft Office is being substituted for OpenOffice. Read more

Ubuntu's shiny 10th birthday Unicorn: An upgrade fantasy

I've been covering Ubuntu for seven of the release’s 10 years and 14.10 is the first time I've had to dig deep into the release notes just to find something new to test. If you needed further proof that Canonical is currently solely focused on bringing its Unity 8 interface to mobile devices, 14.10 is the best evidence yet. Almost nothing Canonical develops has changed in this release - there isn't even a new desktop wallpaper. There are some updates to be sure, but they don’t hail from Canonical. Point release updates for default GNOME apps are included, as is a new kernel, the latest version of Mesa, and some other underlying tools. The lack of updates isn't unexpected, in fact that's been the plan all along. Read more

today's leftovers