Group Fairness in Multilingual Speech Recognition Models

Anna Katrine van Zee, Marc van Zee, Anders Søgaard

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

4 Downloads (Pure)

Abstract

We evaluate the performance disparity of the Whisper and MMS families of ASR models across the VoxPopuli and Common Voice multilingual datasets, with an eye toward intersectionality. Our two most important findings are that model size, surprisingly, correlates logarithmically with worst-case performance disparities, meaning that larger (and better) models are less fair. We also observe the importance of intersectionality. In particular, models often exhibit significant performance disparity across binary gender for adolescents.
OriginalsprogEngelsk
TitelFindings of the Association for Computational Linguistics: NAACL 2024
ForlagAssociation for Computational Linguistics
Publikationsdato2024
Sider2213–2226
StatusUdgivet - 2024
Begivenhed2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 - Hybrid, Mexico City, Mexico
Varighed: 16 jun. 202421 jun. 2024

Konference

Konference2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
Land/OmrådeMexico
ByHybrid, Mexico City
Periode16/06/202421/06/2024
SponsorBaidu, Capital One, et al., Grammarly, Megagon Labs, Otter.ai

Citationsformater