Toward Evaluating the Reproducibility of Information Retrieval Systems with Simulated Users

Timo Breuer, Maria Maistro

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

Reproducibility is a fundamental part of scientific progress. Compared to other scientific fields, computational sciences are privileged as experimental setups can be preserved with ease, and regression experiments allow the validation of computational results by bitwise similarity. When evaluating information access systems, the system users are often considered in the experiments, be it explicit as part of user studies or implicit as part of evaluation measures. Usually, system-oriented Information Retrieval (IR) experiments are evaluated with effectiveness measurements and batches of multiple queries. Successful reproduction of an IR system is often determined by how well it approximates the averaged effectiveness of the original (reproduced) system. Earlier work suggests that the naïve comparison of average effectiveness hides differences that exist between the original and reproduced systems. Most importantly, such differences can affect the recipients of the retrieval results, i.e., the system users. To this end, this work sheds light on what implications for users may be neglected when a system-oriented IR experiment is prematurely considered reproduced. Based on simulated reimplementations with comparable effectiveness as the reference system, we show what differences are hidden behind averaged effectiveness scores. We discuss possible future directions and consider how these implications could be addressed with user simulations.

Original languageEnglish
Title of host publicationProceedings of the 2nd ACM Conference on Reproducibility and Replicability, REP 2024
Number of pages5
PublisherAssociation for Computing Machinery, Inc.
Publication date2024
Pages25-29
ISBN (Electronic)9798400705304
DOIs
Publication statusPublished - 2024
Event2nd ACM Conference on Reproducibility and Replicability, REP 2024 - Rennes, France
Duration: 18 Jun 202420 Jun 2024

Conference

Conference2nd ACM Conference on Reproducibility and Replicability, REP 2024
Country/TerritoryFrance
CityRennes
Period18/06/202420/06/2024
SponsorBrittany Region, EIGREP, Inria, Rennes Metropole, Sandia National Laboratory, University de Rennes

Bibliographical note

Publisher Copyright:
© 2024 Copyright held by the owner/author(s).

Keywords

  • information access
  • reproducibility
  • system users

Cite this