학술논문

Push-to-Trend: A Novel Framework to Detect Trend Promoters in Trending Hashtags
Document Type
Periodical
Source
IEEE Access Access, IEEE. 10:113005-113017 2022
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Market research
Social networking (online)
Blogs
Chatbots
Behavioral sciences
Feature extraction
Machine learning
User experience
Classification algorithms
Twitter trends
trend promoters
social media user classification
Twitter analytics
Language
ISSN
2169-3536
Abstract
Twitter trends have enabled the speedy dissemination of information with the ability to affect public opinion. Unfortunately, fake trends are also generated by malicious users to mislead the public. In general, Twitter users are studied in depth to identify humans, bots, spam, and fake accounts. However, artificial intelligence algorithms are not developed for the identification of ‘trend promoters’ generating fake trends. In this paper, we propose Push-To-Trend – a novel framework to detect ‘trend promoters’ in trending hashtags. For this purpose, first, we develop a dataset of TREP-21 containing 3,900 users labelled into two categories of ‘trend promoters’ and ‘normal users’. In addition, we design four discerning features of number of total tweets, duplicate tweets, overlapping ngram, and peak-to-mean ratio for trend promoters classification. Moreover, we thoroughly examine the features used for spam and bot accounts classification to filter three efficacious features for trend promoters identification. Leveraging these seven features, Push-To-Trend achieves the accuracy of 0.97 for TREP-21. Furthermore, we leverage our framework to identify and analyze trend promoters from the Urdu tweets repository “Anbar” which consists of 106.9 million tweets and 1.69 million users. The analysis of 602 most frequent hashtags in Anbar reveals that 15.7% of trend promoters generate 68.1% of total tweets related to hashtags. To the best of our knowledge, this is the first attempt to design machine learning models for the automatic classification of trend promoters. As such, our framework is generic and adaptable for tweets posted in different natural languages as it utilizes language-independent features.