학술논문

Sentiment Analysis on Tweets Using Machine Learning and Combinatorial Fusion
Document Type
Conference
Source
2019 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech) DASC-PICOM-CBDCOM-CYBERSCITECH Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech), 2019 IEEE Intl Conf on. :1066-1071 Aug, 2019
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Classification algorithms
Machine learning algorithms
Support vector machines
Sentiment analysis
Data models
Diversity, Combinatorial Fusion, Machine Learning, rank score characteristic (RSC) function, and, sentiment analysis
Language
Abstract
Sentiment analysis using social network platforms such as twitter has achieved tremendous results. However, due to its imbalanced data content and semantic context, it remains a challenge to give a full and effective sentiment labeling. In this paper, we propose a two-stage data analytic approach consisting of machine learning algorithms and combinatorial fusion. The first stage uses five machine learning algorithms: logistic regression, naive Bayes, perceptron, random forest, and support vector machine (SVM). Combinatorial fusion is then used to combine subset of these five algorithms. We conduct our investigation using a Kaggle dataset to classify each of the tweets as positive, neutral, or negative sentiment. We demonstrate that although most of the machine learning algorithms perform well, combination of these algorithms with higher performance ratio and cognitive diversity can perform even better.