학술논문

Web search via hub synthesis
Document Type
Conference
Source
Proceedings 42nd IEEE Symposium on Foundations of Computer Science Cluster Computing, 2001. Proceedings. 2001 IEEE International Conference on. :500-509 2001
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
Web search
Computer science
Web pages
Couplings
Information retrieval
Books
Humans
Search engines
Embedded computing
Language
ISSN
1552-5244
Abstract
We present a model for web search that captures in a unified manner three critical components of the problem: how the link structure of the web is generated, how the content of a web document is generated, and how a human searcher generates a query. The key to this unification lies in capturing the correlations between these components in terms of proximity in a shared latent semantic space. Given such a combined model, the correct answer to a search query is well defined, and thus it becomes possible to evaluate web search algorithms rigorously. We present a new web search algorithm, based on spectral techniques, and prove that it is guaranteed to produce an approximately correct answer in our model. The algorithm assumes no knowledge of the model, and is well-defined regardless of the model's accuracy.