학술논문

Inherent Disagreements in Human Textual Inferences

Document Type

article

Author

Pavlick, Ellie; Kwiatkowski, Tom

Source

Transactions of the Association for Computational Linguistics, Vol 7, Pp 677-694 (2019)

Subject

Computational linguistics. Natural language processing
P98-98.5

Language

English

ISSN

2307-387X

Abstract

We analyze human’s disagreements about the validity of natural language inferences. We show that, very often, disagreements are not dismissible as annotation “noise”, but rather persist as we collect more ratings and as we vary the amount of context provided to raters. We further show that the type of uncertainty captured by current state-of-the-art models for natural language inference is not reflective of the type of uncertainty present in human disagreements. We discuss implications of our results in relation to the recognizing textual entailment (RTE)/natural language inference (NLI) task. We argue for a refined evaluation objective that requires models to explicitly capture the full distribution of plausible human judgments.

Online Access

Full Text (ProQuest Central) Open Access (DOAJ) Web of Science Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송