Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-Checkers

Yuxia Wang, Revanth Gangi Reddy, Zain Muhammad Mujahid, Arnav Arora, Aleksandr Rubashevskii, Jiahui Geng, Osama Mohammed Afzal, Liangming Pan, Nadav Borenstein, Aditya Pillai, Isabelle Augenstein, Iryna Gurevych, Preslav Nakov

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

5 Citationer (Scopus)
2 Downloads (Pure)

Abstract

The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs. In this work, we present Factcheck-Bench, a holistic end-to-end framework for annotating and evaluating the factuality of LLM-generated responses, which encompasses a multi-stage annotation scheme designed to yield detailed labels for fact-checking and correcting not just the final prediction, but also the intermediate steps that a fact-checking system might need to take. Based on this framework, we construct an open-domain factuality benchmark in three-levels of granularity: claim, sentence, and document. We further propose a system, Factcheck-GPT, which follows our framework, and we show that it outperforms several popular LLM fact-checkers. We make our annotation tool, benchmark, and code available at https://github.com/yuxiaw/Factcheck-GPT.

OriginalsprogEngelsk
TitelEMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024
RedaktørerYaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Antal sider32
ForlagAssociation for Computational Linguistics (ACL)
Publikationsdato2024
Sider14199-14230
ISBN (Elektronisk)9798891761681
DOI
StatusUdgivet - 2024
Begivenhed2024 Findings of the Association for Computational Linguistics, EMNLP 2024 - Hybrid, Miami, USA
Varighed: 12 nov. 202416 nov. 2024

Konference

Konference2024 Findings of the Association for Computational Linguistics, EMNLP 2024
Land/OmrådeUSA
ByHybrid, Miami
Periode12/11/202416/11/2024
SponsorApple, Bloomberg, Citadel Securities, et al., Google , Meta

Bibliografisk note

Publisher Copyright:
© 2024 Association for Computational Linguistics.

Citationsformater