Fine-tune the Entire RAG Architecture (including DPR retriever) for Question-Answering

In this paper, we illustrate how to fine-tune the entire Retrieval Augment Generation (RAG) architecture in an end-to-end manner.

Open

Preview
Year: 2021
Venue: fine-tune-the-entire-rag-architecture-1
ArXiv: arxiv.org/abs/2106.11517
Authors: 4
Hosting: Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2106.11517ARXIV-DEFAULT
TL;DR: Semantic Scholar

Attribution policy →

Abstract

In this paper, we illustrate how to fine-tune the entire Retrieval Augment Generation (RAG) architecture in an end-to-end manner. We highlighted the main engineering challenges that needed to be addressed to achieve this objective. We also compare how end-to-end RAG architecture outperforms the original RAG architecture for the task of question answering. We have open-sourced our implementation in the HuggingFace Transformers library.

Authors

Shamane Siriwardhana Rivindu Weerasekera Elliott Wen Suranga Nanayakkara