Advanced Metasearch Engine Technology by Weiyi Meng, Clement Yu, M. Tamer Ozsu

By Weiyi Meng, Clement Yu, M. Tamer Ozsu

One of the seek instruments at the moment on the net, se's are the main popular due to the recognition of significant se's resembling Google and Yahoo!. whereas tremendous winning, those significant se's do have critical obstacles. This publication introduces large-scale metasearch engine expertise, which has the aptitude to beat the restrictions of the various search engines. primarily, a metasearch engine is a seek procedure that helps unified entry to a number of present se's via passing the queries it gets to its part se's and aggregating the again effects right into a unmarried ranked checklist. A large-scale metasearch engine has hundreds of thousands or extra part se's. whereas metasearch engines have been before everything encouraged through their skill to mix the quest insurance of a number of se's, there also are different advantages comparable to the aptitude to acquire higher and brisker effects and to arrive the Deep internet. the subsequent significant elements of large-scale metasearch engines can be mentioned intimately during this e-book: seek engine choice, seek engine incorporation, and consequence merging. hugely scalable and automatic recommendations for those elements are emphasised. The authors make a powerful case for the viability of the large-scale metasearch engine know-how as a aggressive expertise for internet seek. desk of Contents: creation / Metasearch Engine structure / seek Engine choice / seek Engine Incorporation / outcome Merging / precis and destiny learn

Show description

Read or Download Advanced Metasearch Engine Technology PDF

Best human-computer interaction books

The Social and Cognitive Impacts of e-Commerce on Modern Organizations

The Social and Cognitive affects of E-Commerce on smooth companies comprises articles addressing the social, cultural, organizational, and cognitive affects of e-commerce applied sciences and advances on corporations world wide. having a look particularly on the affects of digital trade on shopper habit, in addition to the effect of e-commerce on organizational habit, improvement, and administration in corporations.

Handbook of Research on Urban Informatics: The Practice and Promise of the Real-Time City

Alive with circulation and pleasure, towns transmit a speedy stream of trade facilitated through a meshwork of infrastructure connections. during this atmosphere, the net has complex to turn into the top communique medium, making a vivid and more and more researched box of research in city informatics.

Ubiquitous and Pervasive Computing: Concepts, Methodologies, Tools, and Applications

With the improvement of ubiquitous and pervasive computing, elevated and elevated adaptability to altering wishes, personal tastes, and environments will emerge to additional increase using expertise among worldwide cultures and populations. Ubiquitous and Pervasive Computing: innovations, Methodologies, instruments, and functions covers the newest cutting edge examine findings concerned with the incorporation of applied sciences into daily facets of existence from a collaboration of comprehensive box specialists.

Beginning CSS Preprocessors: With Sass, Compass, and Less

Learn the way preprocessors could make CSS scalable and simple to keep up. you will see the best way to write code in a truly fresh and scalable demeanour and use CSS preprocessor good points comparable to variables and looping, that are lacking in CSS natively. analyzing starting CSS Preprocessors will make your existence a lot easier via displaying you ways to create reusable chunks of code.

Extra resources for Advanced Metasearch Engine Technology

Sample text

In practice, a large number of factors may be considered by a similarity function, and the ranking formula can become very complicated. , 2008). 3. CHALLENGING ENVIRONMENT 33 8. Document Version. Individual documents on the Web may be modified anytime by their authors or even automatically by some software. Typically, when a Web page is modified, those search engines that indexed the Web page will not be notified of the modification. Before a search engine can re-fetch and re-index a modified Web page, the representation of the Web page in the search engine is based on a stale or out-dated version of the page.

3. CHALLENGING ENVIRONMENT 31 As we have discussed before, there are significant technical challenges in building very largescale metasearch engines like WebScales. Although much research is still needed, a lot of progress has already been made. In the next three chapters, some of these progresses will be reported. , they are built and maintained independently. The developers of each search engine decide what documents the search engine will provide search services to, how documents should be represented, and when the index should be updated.

As large-scale metasearch engines use only specialized search engines as their component search engines, the contents they can search should also have better quality. Major search engines rely on their crawlers to collect documents from numerous Web servers. However, these crawlers cannot keep up with the fast changing Web contents due to the huge numbers of Web pages and Web servers involved, as well as the constantly changing nature of the Web. It typically takes from several days to couple of weeks for newly updated or added contents to be crawled or re-crawled.

Download PDF sample

Rated 4.62 of 5 – based on 35 votes