Language Selection

English French German Italian Portuguese Spanish

Linux Desktop Search Engines Compared

Filed under
Software

I have a large electronic library (over 15,000 books) and I was looking for a way to cope with this mass of information. I didn't like the idea of a special catalog, since it would take a lot of manual work to enter the metadata. Besides, my books are in various formats, from HTML, to RTF, to DOC, to PDF, to DjVU. These files lack metadata way too often and I thought a local indexing service with a full-text search might solve my problem. I knew there are more options to choose from than just Google, but I could not find a good modern comparison. Even the table in Wikinfo's Comparison of desktop search software contained too many errors, as I discovered.

I had to compare them myself.

My task imposed certain restrictions on the one hand, but made the others irrelevant on the other hand. So, I was especially interested in a wide gamut of file types, in the ability to add new ones (Epub, fb2, html.zip) and in extensive query language. All software, except for GDS and DocFetcher, was installed from Ubuntu 9.10 repositories.

I have no special preferences regarding the backend, it may be Xapian- or Lucene-based tool, or even a custom backend. On the other hand, Xapian usually requires more disk space, and there is never too much space on desktops.

Rest Here




More in Tux Machines

India yet to catch up with FOSS, says Rushabh Mehta of ERPNext

We got a chance to interact with Rushabh Mehta, the founder of Web Notes Technologies, a company based in Mumbai, India. ERPNext is the major product of the company. It is a free and Open Source web based ERP (Enterprise Resource Planning) solution for small and medium sized businesses with its presence in more than 60 countries. In addition to the regular discussions on their Open Source product, strategy, customers etc. we also got a chance to understand how hard it is to thrive in an environment where the “Open Source” philosophy is not a familiar term yet. A software developer by passion and an Industrial Engineer by training, Rushabh also informed us about their imminent product conference in Mumbai he is quite excited about. Read more

Today in Techrights

Mesa 10.3 released

Mesa 10.3 has been released! Mesa 10.3 is a feature release that includes many updates and enhancements. The full list is available in the release notes file in docs/relnotes/10.3.html. The tag in the GIT repository for Mesa 10.3 is 'mesa-10.3'. I have verified that the tag is in the correct place in the tree. Mesa 10.3 is available for download at ftp://freedesktop.org/pub/mesa/10.3/ Read more

Tizen Development Units now available!

The Linux Foundation have today announced the next round of the Tizen development unit program is now available, with the Intel NUC and Samsung RD-PQ hardware devices being available. The Idea behind this program is to put the required hardware in developers hands so they can develop and test their applications on real hardware. It has to be noted that the Samsung RD-PQ device does not have GSM connectivity, and therefore can not be used as a real world device, which is a pity as developers do need real devices so late in the game. Read more