학술논문

Toward a New Paradigm for Author Name Disambiguation
Document Type
Periodical
Source
IEEE Access Access, IEEE. 10:76055-76068 2022
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Semantics
Deep learning
Convolutional neural networks
Training
Support vector machines
Libraries
Collaboration
Author name disambiguation
digital library
deep learning
classification
word embedding
journal descriptor
semantic type
Language
ISSN
2169-3536
Abstract
Author Name Disambiguation (AND) has emerged as a significant challenge in the bibliometric context with the growing volume of scientific literature. When citations written by different authors have the same names (polysemy or homonym names), and when an author has different names, there is ambiguity (synonyms or name variants). It is difficult to associate a citation with the correct author. Polysemy and synonyms cause merging and splitting anomalies in the citations. These anomalies affect the quantification of an author’s productivity (bibliometric analysis) and the reliability and quality of the information retrieved. Many techniques for AND have been proposed in the literature; most of them do not go beyond string matching or text matching. Most of the existing work do not consider the context or semantics of the terms used in the citations. In this study, the AND problem is resolved semantically using the deep learning technique on the PubMed dataset. The experimental results show that the proposed method achieves overall (11.72 %, 12.5 %, and 12.1 %) higher precision, recall, and f-measure than the pairwise class classification.