Language Selection

English French German Italian Portuguese Spanish

The Evolution Of Web Search

Filed under
Web

As we've become completely immersed in the Internet era, Google has become a verb, librarians are increasingly lonely, and most of us have mastered the basics of Boolean logic--without even knowing exactly what it is. We've become a society of information managers, navigating huge amounts of data with ease and expertly tracking down obscure facts and figures.

But as far as we've come, all we've really done is become good at finding needles in haystacks. There's no sophistication, no wisdom involved, and it's largely because our search tools are pretty dumb.

Imagine you were suffering from a bad case of tennis elbow and wanted to find a doctor who could see you on Saturday. A simple Google (nasdaq: GOOG - news - people ) search for "doctors" would find some referral services, but it would also produce pages of doctor jokes and medical associations. More significantly, you'd miss all kinds of "physicians" and "therapists" who might be able to help, simply because you didn't choose that word. Search on "tennis elbow" and you're not going to find help for "athletic injuries." And searching for offices that are "open Saturdays" won't help you find the ones with "weekend hours."

To solve that problem, we need a search system that doesn't just process and parse our language, but understands it; programs that don't just match your search terms but intuitively recognize context to deliver what you're really looking for. Fortunately, engineers and researchers around the world are already at work to bring about this system, and they call it the semantic Web.

Conceived by Tim Berners-Lee, a computer scientist generally considered the father of the World Wide Web, the semantic Web isn't an entirely new network. It's a vision of a world where "tags," or code, is hidden inside Web pages to help computers understand meaning. Individual terms like "doctor" would be tagged with identifying code allowing a program reading the document to refer back to a central dictionary and learn that a "doctor" is the same as a "physician."

But the semantic Web isn't just a fancy thesaurus. It also defines the relationships between words, allowing a program to understand that "price" is measured in "dollars," which can be converted into "yen," and that both of those words refer to different kinds of "money."

The resulting system bears the same relationship to today's Web as a pile of books does to a well-cataloged library. "The World Wide Web, as we know it today, is mostly unstructured content," says Burton Group analyst Peter O'Kelly. "The general idea is to infuse more meaning, try to provide more of a sense of structure about the world."

Full Story.

More in Tux Machines

Embedded Development with ARM mbed on Linux

The ARM CPU used in the BeagleBone Black and other single board computers is designed to interface with half to a few gigabytes of RAM and allow a full operating system such as Linux to be run on the computer. (See my long series of reviews on Linux.com of ARM-based computers that run Linux). By contrast the ARM Cortex-M is a microcontroller level chip which might run at 16-100Mhz, contain 2-100kb of RAM, and some flash memory to contain only the program that you want to execute. Read more

Confessions of a systems librarian

These are just two examples of serious flagship projects, but even on a day-to-day level there are plenty of opportunities for systems librarians to interact with open source software. A large amount of vendor software runs on Linux, so there’s plenty of systems administration to do. I work in a relatively small library, and even here we run five Drupal websites: one as a portal for library services, one as the primary repository for our archive, another provides the public interface for an aboriginal research center, and one to manage safety information for our bio sciences lab. Read more

IoT, Revolution In Personal Computing, or FLOSS Taking Over the World of IT

What can you say? In a few short years, that other OS has gone from mainstream to niche and Android/Linux and GNU/Linux are stepping up to displace it as the goto OS of the world. It’s all good. This is the right way to do IT with the world making its own software throughout the whole stack: OS on client and server and a ton of applications too. There is no need for a monopoly in IT. The world wants a revolution not lock-in. Read more

Political parties favour openness to reconstruct Greek productivity

Ahead of the parliamentary elections in Greece last week, the Greek Free/Open Source Software Society (GFOSS) contacted all political parties to ask about their positions [in Greek] with regard to open software, open data, open hardware and open government. The four parties to respond all came out generally in favour of openness. Some of them were even able to present very detailed planning on how to improve the current institutional and legislative framework and outlined how openness could help reconstruct Greek productivity. Read more