Abstract
Retrieval-augmented models have proven to be effective in natural language processing tasks, yet there remains a lack of research on their optimization using variational inference. We introduce the Variational Open-Domain (VOD) framework for end-to-end training and evaluation of retrieval-augmented models, focusing on open-domain question answering and language modelling. The VOD objective, a self-normalized estimate of the Rényi variational bound, approximates the task marginal likelihood and is evaluated under samples drawn from an auxiliary sampling distribution (cached retriever and/or approximate posterior). It remains tractable, even for retriever distributions defined on large corpora. We demonstrate VOD's versatility by training reader-retriever BERT-sized models on multiple-choice medical exam questions. On the MedMCQA dataset, we outperform the domain-tuned Med-PaLM by +5.3% despite using 2.500× fewer parameters. Our retrieval-augmented BioLinkBERT model scored 62.9% on the MedMCQA and 55.0% on the MedQA-USMLE. Last, we show the effectiveness of our learned retriever component in the context of medical semantic search. © 2023 Proceedings of Machine Learning Research. All rights reserved.
Originalsprog | Engelsk |
---|---|
Tidsskrift | Proceedings of Machine Learning Research |
Vol/bind | 202 |
Sider (fra-til) | 20950-20977 |
Antal sider | 28 |
ISSN | 2640-3498 |
Status | Udgivet - 2023 |
Begivenhed | 40th International Conference on Machine Learning, ICML 2023 - Honolulu, USA Varighed: 23 jul. 2023 → 29 jul. 2023 |
Konference
Konference | 40th International Conference on Machine Learning, ICML 2023 |
---|---|
Land/Område | USA |
By | Honolulu |
Periode | 23/07/2023 → 29/07/2023 |
Bibliografisk note
Funding Information:VL’s work was funded in part by Google DeepMind through a PhD grant. OW’s work was funded in part by the Novo Nordisk Foundation through the Center for Basic Machine Learning Research in Life Science (NNF20OC0062606). VL and OW acknowledge support from the Pioneer Centre for AI, DNRF grant number P1.
Publisher Copyright:
© 2023 Proceedings of Machine Learning Research. All rights reserved.