학술논문

Sentiment Analysis on ChatGPT Reviews
Document Type
Conference
Source
2024 IEEE International Students' Conference on Electrical, Electronics and Computer Science (SCEECS) Electrical, Electronics and Computer Science (SCEECS), 2024 IEEE International Students' Conference on. :1-5 Feb, 2024
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Training
Measurement
Sentiment analysis
Analytical models
Reviews
Computational modeling
Machine learning
Predictive models
Chatbots
Task analysis
Smote
Bag of word
Supervised Learning
Random forest
Sentiment Analysis
Language
ISSN
2688-0288
Abstract
Sentiment analysis, a critical aspect of natural language processing, involves the identification and interpretation of emotions, opinions, and attitudes expressed in text. This review explores the innovative approach of employing ChatGPT, a prominent language model developed by OpenAI, for sentiment analysis tasks. This study employs the Bag-of-Word (BoW) technique to analyze sentiment. The research will also focus on creating models through machine learning methods. The dataset is obtained from the App Store, specifically targeting reviews related to ChatGPT. Filtering ensures only English reviews are considered, resulting in a dataset of 22,000 reviews. A preprocessing approach is applied for data cleaning before utilizing BoW for feature extraction. Subsequently, the study involves deploying, training, and assessing classifiers. Evaluation metrics are employed to measure classifier precision. Among the four classifiers tested, XGBOOST (XGB) excels when combined with BoW features, showcasing the highest accuracy. In the context of Bag-of-Words, the XGBOOST classifier achieves an 84% accuracy rate in data classification.