Language Selection

English French German Italian Portuguese Spanish

PhpDig excels at small Web site indexing

Filed under
HowTos

Webmasters looking to provide search capabilities for their site would do well to try out PhpDig, a Web spider and search engine written in PHP with a MySQL backend. There are other open source search engines, all of which have their own advantages. PhpDig just happens to suit the needs of my Information Technology for Greenhouses and Horticulture site. Here's how I got it working.

Webmasters with small sites know the problem of providing useful site search capabilities. Typically, visitors enter keywords in a search box and the search engine returns a ranked list of pages related to the query. This is a useful service -- provided the visitor can tune the search and the results returned are reliable and relevant.

Some Webmasters rely on Google for this service. A listing in Google or another mainstream engine is a must-have in practical terms, so it is easy enough to piggyback on the main engine with a site-specific search, provided Google understands your site and keeps coming back for updates -- but this isn't always the case.

Large search engines boast of indexing of billions of pages, but we are only interested in digesting a hundred pages or so. We need them indexed on a regular basis, daily or at least more often than Google might do it.

It is also important to know if our site is responding correctly by providing public pages, hiding private pages, and following links correctly. Since Google uses algorithms that it doesn't share, we have no way of predicting the indexing results or doing any testing in advance. Advance testing is useful if, for example, you have private files that you want to be sure will not be indexed, but you are relying on your robots.txt file to deny access to bots. If we make a spelling mistake in robots.txt, our private pages could go in Google's cache for the world to read. We also need to control what words are indexed and customize our own search and result pages.

Enter PhpDig.

More in Tux Machines

Ubuntu Touch OTA-11 Launches Next Week for Supported Ubuntu Phones and Tablets

Another week has passed, and the major OTA-11 software update for the Ubuntu mobile operating system, which powers the Ubuntu Phone and Tablet devices, is getting closer to a final release. Read more

Linux Kernel 3.18.34 LTS Has OCFS2 Improvements, Updated Intel GPU Drivers, More

Immediately after informing the community about the general availability of Linux kernel 4.1.25 LTS, kernel developer Sasha Levin published details about the release of Linux kernel 3.18.34 LTS. Read more

CentOS 7 KDE: not for home users

Whom can I recommend CentOS to? Probably to people I mentioned in the very beginning of this article: students who want to dedicate their life to system administration. You need to learn how to search for the answers, how to do things manually. That is your profession. That is your bread. CentOS gives you a brilliant opportunity to learn all of that along with learning the system itself. But CentOS is not for home users who want things done quickly and easily, I'm afraid. Read more

System76 Is Giving Away a Lemur Laptop Preloaded with Ubuntu 16.04 LTS

System76, a US-based hardware company known for building powerful computers with the popular Ubuntu Linux preloaded, has announced earlier that they are giving away one of their superb Lemur laptops during the Denver Mini Maker Faire event. Read more