Language Selection

English French German Italian Portuguese Spanish

PhpDig excels at small Web site indexing

Filed under
HowTos

Webmasters looking to provide search capabilities for their site would do well to try out PhpDig, a Web spider and search engine written in PHP with a MySQL backend. There are other open source search engines, all of which have their own advantages. PhpDig just happens to suit the needs of my Information Technology for Greenhouses and Horticulture site. Here's how I got it working.

Webmasters with small sites know the problem of providing useful site search capabilities. Typically, visitors enter keywords in a search box and the search engine returns a ranked list of pages related to the query. This is a useful service -- provided the visitor can tune the search and the results returned are reliable and relevant.

Some Webmasters rely on Google for this service. A listing in Google or another mainstream engine is a must-have in practical terms, so it is easy enough to piggyback on the main engine with a site-specific search, provided Google understands your site and keeps coming back for updates -- but this isn't always the case.

Large search engines boast of indexing of billions of pages, but we are only interested in digesting a hundred pages or so. We need them indexed on a regular basis, daily or at least more often than Google might do it.

It is also important to know if our site is responding correctly by providing public pages, hiding private pages, and following links correctly. Since Google uses algorithms that it doesn't share, we have no way of predicting the indexing results or doing any testing in advance. Advance testing is useful if, for example, you have private files that you want to be sure will not be indexed, but you are relying on your robots.txt file to deny access to bots. If we make a spelling mistake in robots.txt, our private pages could go in Google's cache for the world to read. We also need to control what words are indexed and customize our own search and result pages.

Enter PhpDig.

More in Tux Machines

NVIDIA Linux OpenCL Performance vs. Radeon ROCm / AMDGPU-PRO

Earlier this week I posted some benchmarks of GPUOpen's new Radeon Open Compute ROCm OpenCL stack that premiered last month and they are working to make completely open-source. In those initial benchmarks I compared the ROCm 1.4 OpenCL performance to the existing AMDGPU-PRO OpenCL implementation on Linux. For those wondering how these two Radeon OpenCL stacks compare to NVIDIA, here are some fresh benchmarks. Read more

Leftovers: Ubuntu and Debian

  • Download Links & Torrents for Debian 8.7 GNU/Linux
    Debian 8.7 GNU/Linux has been released at January 14th 2017. This is an update for Debian 8 (stable, Jessie) mainly for fixing security issues. Here I listed download links for 64 bit and 32 bit versions including torrent links. This article is intended as simple guide for new comers into Debian.
  • This Dev Is Working on a Way to Run Android Apps on Ubuntu Phone
    I’m writing about this way too early, but I figured it may help stoke a few fading hearts among the Ubuntu Phone faithful in light of recent news. Ubports developer (and all round awesome dude) Marius Grispgård has revealed that he’s working on a way to run Android apps on Ubuntu Phone.
  • Ubuntu 17.04 Zesty Zapus Release Schedule
    Ubuntu 17.04, which has got codename 'Zesty Zapus', is currently penciled in to ship on 13th April, 2017. The release date for Ubuntu 17.04 has now been firmed up as are the other development milestones leading up to the mid-April, currently we know that Unity 8 is going to be the interesting feature which will be shipped in 17.04 and swap partitions will likely to be replaced by swap files as mentioned by Canonical's Dimitri John Ledkov, and rest what's new coming in this release we don't know.

today's howtos

GNU/Linux and Servers