학술논문
Web search via hub synthesis
Document Type
Conference
Author
Source
Proceedings 42nd IEEE Symposium on Foundations of Computer Science Cluster Computing, 2001. Proceedings. 2001 IEEE International Conference on. :500-509 2001
Subject
Language
ISSN
1552-5244
Abstract
We present a model for web search that captures in a unified manner three critical components of the problem: how the link structure of the web is generated, how the content of a web document is generated, and how a human searcher generates a query. The key to this unification lies in capturing the correlations between these components in terms of proximity in a shared latent semantic space. Given such a combined model, the correct answer to a search query is well defined, and thus it becomes possible to evaluate web search algorithms rigorously. We present a new web search algorithm, based on spectral techniques, and prove that it is guaranteed to produce an approximately correct answer in our model. The algorithm assumes no knowledge of the model, and is well-defined regardless of the model's accuracy.