학술논문

A Metamodel Enabled Approach for Discovery of Coherent Topics in Short Text Microblogs
Document Type
Periodical
Source
IEEE Access Access, IEEE. 6:65582-65593 2018
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Twitter
Vocabulary
Data mining
Semantics
Noise measurement
Computational modeling
Social computing
topic coherence
short text mining
metamodel
Language
ISSN
2169-3536
Abstract
Comprehending social media discussions in short text microblogs is fundamental for knowledge-based applications like recommender systems. Twitter, for example, provides rich real-time information in keeping with its streaming nature. Making sense of such data without automated support is not feasible due to its vast size and nature. The problem becomes more complex when the data in question have a low variance in terms of topical diversity. Therefore, an automatic method for understanding textual patterns in such topically constrained data needs to be developed. A major challenge to building such a system is in its ability to comprehend the nature of the data with regard to diversity of word structure correlations, vocabulary sparsity, and distinguishing factors in the generated topics. In this paper, we present a novel semi-supervised approach called metamodel enabled latent Dirichlet allocation to address this challenge. Compared to state-of-the-art approaches, our model incorporates a domain-specific metamodel. The metamodel is defined as a set of topic label vectors derived from long texts to guide the learning process in shorter texts.