The Sensitivity of Annotator Bias to Task Definitions in Argument Mining

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Dokumenter

Fulltext
Forlagets udgivne version, 1,44 MB, PDF-dokument

NLP models are dependent on the data they are trained on, including how this data is annotated. NLP research increasingly examines the social biases of models, but often in the light of their training data and specific social biases that can be identified in the text itself. In this paper, we present an annotation experiment that is the first to examine the extent to which social bias is sensitive to how data is annotated. We do so by collecting annotations of arguments in the same documents following four different guidelines and from four different demographic annotator backgrounds. We show that annotations exhibit widely different levels of group disparity depending on which guidelines annotators follow. The differences are not explained by task complexity, but rather by characteristics of these demographic groups, as previously identified by sociological studies. We release a dataset that is small in the number of instances but large in the number of annotations with demographic information, and our results encourage an increased awareness of annotator bias.

Originalsprog	Engelsk
Titel	Proceedings of the 16th Linguistic Annotation Workshop, LAW 2022 - held in conjunction with the Language Resources and Evaluation Conference, LREC 2022 Workshop
Redaktører	Sameer Pradhan, Sandra Kubler
Forlag	European Language Resources Association (ELRA)
Publikationsdato	2022
Sider	44-61
ISBN (Elektronisk)	9782493814081
Status	Udgivet - 2022
Begivenhed	16th Linguistic Annotation Workshop, LAW 2022 - Marseille, Frankrig Varighed: 24 jun. 2022 → …

Konference

Konference	16th Linguistic Annotation Workshop, LAW 2022
Land	Frankrig
By	Marseille
Periode	24/06/2022 → …

Bibliografisk note

Antal downloads er baseret på statistik fra Google Scholar og www.ku.dk

Ingen data tilgængelig

ID: 333703780

Om Københavns Universitet