학술논문

Generating and Testing Synthetic Datasets for Recommender Systems to Improve Fairness in Collaborative Filtering Research
Document Type
Conference
Source
2023 20th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA) Computer Systems and Applications (AICCSA), 2023 20th ACS/IEEE International Conference on. :1-6 Dec, 2023
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Codes
Collaborative filtering
Neural networks
Software
Reproducibility of results
Data models
Recommender systems
Fairness
Recommender Systems
Synthetic Dataset
Collaborative Filtering
Demographic Features
Language
ISSN
2161-5330
Abstract
Fairness is an important field in the Recommender Systems area since collaborative filtering datasets tend to be demographically biased. This paper proposes a parameterized model and provides an open repository of code to generate synthetic datasets containing demographic information. The model parameters can be set to hold different numbers of minority and nonminority users, distributions, and their overlapping. A neural network model has also been used to test the accuracy obtained in different scenarios, by setting the number of minority users and the overlapping between minority and nonminority distributions of votes. The results show how minority users receive unfair recommendations, particularly when their number decreases and when the distributions of minority versus nonminority users partially overlap. This research can be easily extended by designing more sophisticated models and modifying the provided framework; in this sense, a specific future work has been proposed.