학술논문

Exploiting heterogeneous scientific literature networks to combat ranking bias: Evidence from the computational linguistics area.
Document Type
Article
Source
Journal of the Association for Information Science & Technology. Jul2016, Vol. 67 Issue 7, p1679-1702. 24p. 3 Diagrams, 7 Charts, 8 Graphs.
Subject
*Algorithms
*Natural language processing
*Research funding
Linguistics
Citation analysis
Statistical models
Language
ISSN
2330-1635
Abstract
It is important to help researchers find valuable papers from a large literature collection. To this end, many graph-based ranking algorithms have been proposed. However, most of these algorithms suffer from the problem of ranking bias. Ranking bias hurts the usefulness of a ranking algorithm because it returns a ranking list with an undesirable time distribution. This paper is a focused study on how to alleviate ranking bias by leveraging the heterogeneous network structure of the literature collection. We propose a new graph-based ranking algorithm, Mutual Rank, that integrates mutual reinforcement relationships among networks of papers, researchers, and venues to achieve a more synthetic, accurate, and less-biased ranking than previous methods. Mutual Rank provides a unified model that involves both intra- and inter-network information for ranking papers, researchers, and venues simultaneously. We use the ACL Anthology Network as the benchmark data set and construct the gold standard from computer linguistics course websites of well-known universities and two well-known textbooks. The experimental results show that Mutual Rank greatly outperforms the state-of-the-art competitors, including Page Rank, HITS, Co Rank, Future Rank, and P- Rank, in ranking papers in both improving ranking effectiveness and alleviating ranking bias. Rankings of researchers and venues by Mutual Rank are also quite reasonable. [ABSTRACT FROM AUTHOR]