Wednesday, May 05, 2004

Google & Beyond

With the google IPO impending, the geek talk is strongly focused on Search Technology of the king Google and with speculation about weather they are going to be able to retain their leadership position with some big guns (microsoft & yahoo) getting their act together in this space and the small tech focused startups. If interested in this discussion a must read is the technologyreview.com article/piece titled "Search Beyond Google", By Wade Roush.

Highlights of the article are:

Competitor 1: Mooter, a search engine from down under .
How it works?
Before dumping a long list of links, Mooter analyses the potential meaning and permutations of the starting keywords and behind the scenes ranks the relevance of the resulting web pages within broad categories called clusters. the user first sees an on screen start burst of cluster names.
Then comes the learning part...to develop a more precise understanding of what the user is probably looking for, the mooter engines notes which cluster or links get clicked and uses the information to improve future links.
Mooter will personalize the ranking to reflect the apparent pattern of interest. A refined set of results appear on every page the engine continues to adjust the ranking based on the users behavior.

Competitor 2: Teoma
How it works?
Before the Teoma engine present the results for a given set of key words, it identifies the associated communities and looks for authorities within them, that is the pages that the community members websites point to most often. Teoma tries to verify the credibility of these authority pages by checking weather they are listed on resource pages created by subject experts or enthusiasts, which tend to link to the best pages in the community. It then ranks the search results according to how often each page is cited by authority pages.

Competitor 3: Dipsie
How it works?
Minning the deep web, What is the deep web? - Content that is protected behind sing up forms or stored in databases such as product catalogues or legal or medical archives and only assembled into webpages at the time of user request.
Has built a nimble crawler that can get past forms and database interfaces to index the content.
It becomes publicly available this summer, its index will include 10 billion documents, triple the size of google.

About the giant (Microsoft) himself details in the next blog...



0 Comments:

Post a Comment

<< Home