MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering

Zhang Siyue,Yuxiang Xue,Yiming Zhang,Xiaobao Wu,Anh Tuan Luu,Zhaoyu Chen

Published 2024 in arXiv.org

ABSTRACT

Understanding temporal relations and answering time-sensitive questions is crucial yet a challenging task for question-answering systems powered by large language models (LLMs). Existing approaches either update the parametric knowledge of LLMs with new facts, which is resource-intensive and often impractical, or integrate LLMs with external knowledge retrieval (i.e., retrieval-augmented generation). However, off-the-shelf retrievers often struggle to identify relevant documents that require intensive temporal reasoning. To systematically study time-sensitive question answering, we introduce the TempRAGEval benchmark, which repurposes existing datasets by incorporating temporal perturbations and gold evidence labels. As anticipated, all existing retrieval methods struggle with these temporal reasoning-intensive questions. We further propose Modular Retrieval (MRAG), a trainless framework that includes three modules: (1) Question Processing that decomposes question into a main content and a temporal constraint; (2) Retrieval and Summarization that retrieves evidence and uses LLMs to summarize according to the main content; (3) Semantic-Temporal Hybrid Ranking that scores each evidence summarization based on both semantic and temporal relevance. On TempRAGEval, MRAG significantly outperforms baseline retrievers in retrieval performance, leading to further improvements in final answer accuracy.

PUBLICATION RECORD

Publication year
2024
Venue
arXiv.org
Publication date
2024-12-20
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2412.15540 arXiv 2412.15540
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
2025cited by this paper
Search-o1: Agentic Search-Enhanced Large Reasoning Models
2025cited by this paper
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
2025cited by this paper
GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments
2025cited by this paper
Tug-of-War between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
2024cited by this paper
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
2024cited by this paper
TimeR^4 : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering
2024cited by this paper
Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback
2024cited by this paper
Who's Who: Large Language Models Meet Knowledge Conflicts in Practice
2024cited by this paper
SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA
2024cited by this paper
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
2024cited by this paper
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?
2024cited by this paper
ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering
2024cited by this paper
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
2024influential reference
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
2024cited by this paper
AKEW: Assessing Knowledge Editing in the Wild
2024cited by this paper
Local and Global: Temporal Question Answering via Information Fusion
2023cited by this paper
C-Pack: Packaged Resources To Advance General Chinese Embedding
2023cited by this paper
Lost in the Middle: How Language Models Use Long Contexts
2023cited by this paper
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
2023influential reference
ClueWeb22: 10 Billion Web Documents with Rich Information
2022cited by this paper
StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in Question Answering Models
2022cited by this paper
Temporal knowledge graph question answering via subgraph reasoning
2022cited by this paper
Improving Event Duration Question Answering by Leveraging Existing Temporal Information Extraction Data
2022cited by this paper
Question Answering Over Temporal Knowledge Graphs
2021cited by this paper
Unsupervised Dense Information Retrieval with Contrastive Learning
2021influential reference
Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation
2021influential reference
Complex Temporal Question Answering on Knowledge Graphs
2021cited by this paper
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
2021influential reference
A Dataset for Answering Time-Sensitive Questions
2021cited by this paper
Time-Aware Language Models as Temporal Knowledge Bases
2021cited by this paper
Dense Passage Retrieval for Open-Domain Question Answering
2020cited by this paper
A Memory Efficient Baseline for Open Domain Question Answering
2020cited by this paper
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks
2020cited by this paper
Evaluating Models’ Local Decision Boundaries via Contrast Sets
2020influential reference
Efficient Document Re-Ranking for Transformers by Precomputing Term Representations
2020cited by this paper
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
2020cited by this paper
MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers
2020influential reference
Language Models are Unsupervised Multitask Learners
2019cited by this paper
大規模要約資源としてのNew York Times Annotated Corpus
2015cited by this paper
Language Models
2009cited by this paper
Information Retrieval
2008cited by this paper

CITED BY

Efficient Temporal-aware Matryoshka Adaptation for Temporal Information Retrieval
2026cites this paper
Analyzing Diffusion and Autoregressive Vision Language Models in Multimodal Embedding Space
2026cites this paper
DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes
2025cites this paper
It's High Time: A Survey of Temporal Information Retrieval and Question Answering
2025cites this paper
mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation
2025cites this paper
Reading Between the Timelines: RAG for Answering Diachronic Questions
2025influential citation
Topic-FlipRAG: Topic-Orientated Adversarial Opinion Manipulation Attacks to Retrieval-Augmented Generation Models
2025cites this paper
A Question Answering Dataset for Temporal-Sensitive Retrieval-Augmented Generation
2025cites this paper
Re3: Learning to Balance Relevance & Recency for Temporal Information Retrieval
2025cites this paper
RAG Meets Temporal Graphs: Time-Sensitive Modeling and Retrieval for Evolving Knowledge
2025cites this paper
It's High Time: A Survey of Temporal Question Answering
2025cites this paper
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
2025cites this paper