Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding

Published 2026 in arXiv.org

ABSTRACT

Large Language Models (LLMs) often hallucinate, generating content inconsistent with the input. Retrieval-Augmented Generation (RAG) and Reinforcement Learning with Human Feedback (RLHF) can mitigate hallucinations but require resource-intensive retrieval or large-scale fine-tuning. Decoding-based methods are lighter yet lack explicit hallucination control. To address this, we present Token-Guard, a token-level hallucination control method based on self-checking decoding. Token-Guard performs internal verification at each reasoning step to detect hallucinated tokens before they propagate. Candidate fragments are further evaluated in a latent space with explicit hallucination risk scoring, while iterative pruning and regeneration dynamically correct detected errors. Experiments on HALU datasets show Token-Guard substantially reduces hallucinations and improves generation accuracy, offering a scalable, modular solution for reliable LLM outputs. Our code is publicly available.

PUBLICATION RECORD

Publication year
2026
Venue
arXiv.org
Publication date
2026-01-29
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2601.21969 arXiv 2601.21969
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

HyperGraphRAG: Retrieval-Augmented Generation via Hypergraph-Structured Knowledge Representation
2025cited by this paper
ϕ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
2025cited by this paper
Reflection-Window Decoding: Text Generation with Selective Refinement
2025cited by this paper
KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search
2025cited by this paper
Automating RLHF-Based Hallucination Tracking
2025cited by this paper
Multi-Layered Framework for LLM Hallucination Mitigation in High-Stakes Applications: A Tutorial
2025cited by this paper
Let's Revise Step-by-Step: A Unified Local Search Framework for Code Generation with LLMs
2025cited by this paper
Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning
2025cited by this paper
Token-Level Uncertainty Estimation for Large Language Model Reasoning
2025cited by this paper
Lynx: An Open Source Hallucination Evaluation Model
2024cited by this paper
Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases
2024cited by this paper
Non-myopic Generation of Language Models for Reasoning and Planning
2024cited by this paper
IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking
2024cited by this paper
From data to decisions: enhancing financial forecasts with LSTM for AI token prices
2024cited by this paper
Token-level Direct Preference Optimization
2024cited by this paper
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection
2024cited by this paper
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
2023cited by this paper
Self-Refine: Iterative Refinement with Self-Feedback
2023cited by this paper
Self-Evaluation Guided Beam Search for Reasoning
2023influential reference
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
2023influential reference
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models
2023cited by this paper
How Language Model Hallucinations Can Snowball
2023cited by this paper
Large Language Models
2023cited by this paper
Detecting and Preventing Hallucinations in Large Vision Language Models
2023cited by this paper
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
2023cited by this paper
Towards Mitigating Hallucination in Large Language Models via Self-Reflection
2023cited by this paper
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection
2023cited by this paper
Branch-Solve-Merge Improves Large Language Model Evaluation and Generation
2023cited by this paper
FinanceBench: A New Benchmark for Financial Question Answering
2023cited by this paper
Towards Mitigating LLM Hallucination via Self Reflection
2023cited by this paper
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models
2023influential reference
Chain of Thought Prompting Elicits Reasoning in Large Language Models
2022cited by this paper
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
2022cited by this paper
Retrieval Augmentation Reduces Hallucination in Conversation
2021cited by this paper
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
2021cited by this paper
COVID-QA: A Question Answering Dataset for COVID-19
2020cited by this paper
Learning to summarize from human feedback
2020cited by this paper
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
2020influential reference
PubMedQA: A Dataset for Biomedical Research Question Answering
2019cited by this paper
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
2019cited by this paper
Deep Reinforcement Learning from Human Preferences
2017cited by this paper
Paper
1977cited by this paper
Detecting LLM Hallucinations Using Monte Carlo Simulations on Token Probabilities
year unknowncited by this paper

CITED BY

No citing papers are available for this paper.