학술논문

Automated Categorization of Privacy Policies Based on User Perspective
Document Type
Conference
Source
2021 10th International Conference on Information and Automation for Sustainability (ICIAfS) Information and Automation for Sustainability (ICIAfS), 2021 10th International Conference on. :54-59 Aug, 2021
Subject
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Support vector machines
Data privacy
Privacy
Transfer learning
Computer architecture
Knowledge discovery
Mobile applications
Privacy Policy
Usable Data Privacy
Text Classification
Natural Language Processing
Mobile Computing
Language
ISSN
2151-1810
Abstract
Data privacy deals with the sensitive information of individuals and has become a major topic in modern society. Although the practicability of mobile apps has become an essential part of the routines of many people, consumers are increasingly concerned about their data privacy. Most of the time, these privacy policies that are used to collect and share data are lengthy and complex to understand. Therefore, a common user finds it hard to understand before agreeing to the privacy policy terms. This study aims to propose an approach to convey privacy policies in a way that a common user can comprehend. In this paper, we present 10 categories to classify headers and sections of privacy policies, selected after considering both the users’ and domain experts’ views, and an automated privacy policy classification model. Bert, SVM, Naive Bayes, and BiLSTM models are used as the baseline classification models. Our best performing model Bert shows an F1-score of 81%. A dataset of 11K headers and sections of privacy policies classified under the 10 categories and high-level architecture of the privacy policy question answering model are also presented here. Thus, the proposed solution makes awareness and gives an insight to mobile app users on their data privacy.