학술논문

On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference
Document Type
Working Paper
Source
Subject
Computer Science - Computation and Language
Language
Abstract
Popular Natural Language Inference (NLI) datasets have been shown to be tainted by hypothesis-only biases. Adversarial learning may help models ignore sensitive biases and spurious correlations in data. We evaluate whether adversarial learning can be used in NLI to encourage models to learn representations free of hypothesis-only biases. Our analyses indicate that the representations learned via adversarial learning may be less biased, with only small drops in NLI accuracy.
Comment: StarSem 2019 - The Eighth Joint Conference on Lexical and Computational Semantics