Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
Abstract
In the field of language modeling, models augmented with retrieval components have emerged as a promising solution to address several challenges faced in the natural language processing (NLP) field, including knowledge grounding, interpretability, and scalability. Despite the primary focus on NLP, we posit that the paradigm of retrieval-enhancement can be extended to a broader spectrum of machine learning (ML) such as computer vision, time series prediction, and computational biology. Therefore, this work introduces a formal framework of this paradigm, Retrieval-Enhanced Machine Learning (REML), by synthesizing the literature in various domains in ML with consistent notations which is missing from the current literature. Also, we found that while a number of studies employ retrieval components to augment their models, there is a lack of integration with foundational Information Retrieval (IR) research. We bridge this gap between the seminal IR research and contemporary REML studies by investigating each component that comprises the REML framework. Ultimately, the goal of this work is to equip researchers across various disciplines with a comprehensive, formally structured framework of retrieval-enhanced models, thereby fostering interdisciplinary future research.
Community
We extend Retrieval Augmented Generation (RAG) and similar techniques to the broader AI community in a unifying framework we call Retrieval Enhanced Machine Learning (REML). Compared to the previous REML paper, this one is updated with recent techniques representative of each module in REML, and also provides an update outlook for REML-related research opportunities.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Large Language Model Enhanced Knowledge Representation Learning: A Survey (2024)
- Enhancing Knowledge Retrieval with In-Context Learning and Semantic Search through Generative AI (2024)
- A Multi-Source Retrieval Question Answering Framework Based on RAG (2024)
- Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning (2024)
- A Survey of Generative Information Retrieval (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper