학술논문

Shortcut Learning Explanations for Deep Natural Language Processing: A Survey on Dataset Biases

Document Type

article

Author

Varun Dogra; Sahil Verma; Kavita; Marcin Wozniak; Jana Shafi; Muhammad Fazal Ijaz

Source

IEEE Access, Vol 12, Pp 26183-26195 (2024)

Subject

Dataset biases
deep learning
natural language processing
shortcut learning
transfer learning
Electrical engineering. Electronics. Nuclear engineering
TK1-9971

Language

English

ISSN

2169-3536

Abstract

The introduction of pre-trained large language models (LLMs) has transformed NLP by fine-tuning task-specific datasets, enabling notable advancements in news classification, language translation, and sentiment analysis. This has revolutionized the field, driving remarkable breakthroughs and progress. However, the growing recognition of bias in textual data has emerged as a critical focus in the NLP community, revealing the inherent limitations of models trained on specific datasets. LLMs exploit these dataset biases and artifacts as expedient shortcuts for prediction. The reliance of LLMs on dataset bias and artifacts as shortcuts for prediction has hindered their generalizability and adversarial robustness. Addressing this issue is crucial to enhance the reliability and resilience of LLMs in various contexts. This survey provides a comprehensive overview of the rapidly growing body of research on shortcut learning in language models, classifying the research into four main areas: the factors of shortcut learning, the origin of bias, the detection methods of dataset biases, and understanding mitigation strategies to address data biases. The goal of this study is to offer a contextualized, in-depth look at the state of learning models, highlighting the major areas of attention and suggesting possible directions for further research.

Online Access

Open Access (DOAJ) Open Access (EBSCO) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송