Deep neural learning on weighted datasets utilizing label disagreement from crowdsourcing

Dongsheng Wang, Prayag Tiwari, Mohammad Shorfuzzaman, Ingo Schmitt*

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

5 Citationer (Scopus)
9 Downloads (Pure)

Abstract

Experts and crowds can work together to generate high-quality datasets, but such collaboration is limited to a large-scale pool of data. In other words, training on a large-scale dataset depends more on crowdsourced datasets with aggregated labels than expert intensively checked labels. However, the limited amount of high-quality dataset can be used as an objective test dataset to build a connection between disagreement and aggregated labels. In this paper, we claim that the disagreement behind an aggregated label indicates more semantics (e.g. ambiguity or difficulty) of an instance than just spam or error assessment. We attempt to take advantage of the informativeness of disagreement to assist learning neural networks by computing a series of disagreement measurements and incorporating disagreement with distinct mechanisms. Experiments on two datasets demonstrate that the consideration of disagreement, treating training instances differently, can promisingly result in improved performance.

OriginalsprogEngelsk
Artikelnummer108227
TidsskriftComputer Networks
Vol/bind196
Antal sider7
ISSN1389-1286
DOI
StatusUdgivet - 2021

Bibliografisk note

Publisher Copyright:
© 2021 The Authors

Citationsformater