Towards Native Intelligence: 6G-LLM Trained with Reinforcement Learning from NDT Feedback

Zhuoran Xiao,Tao Tao,Chenhui Ye,Yunbo Hu,Yijia Feng,Tianyu Jiao,Liyu Cai

Published 2026 in Unknown venue

ABSTRACT

Owing to its comprehensive understanding of upper-layer application requirements and the capabilities of practical communication systems, the 6G-LLM (6G domain large language model) offers a promising pathway toward realizing network native intelligence. Serving as the system orchestrator, the 6G-LLM drives a paradigm shift that fundamentally departs from existing rule-based approaches, which primarily rely on modular, experience-driven optimization. By contrast, the 6G-LLM substantially enhances network flexibility and adaptability. Nevertheless, current efforts to construct 6G-LLMs are constrained by their reliance on large-scale, meticulously curated, human-authored corpora, which are impractical to obtain in real-world scenarios. Moreover, purely offline-trained models lack the capacity for continual self-improvement, limiting their ability to adapt to the highly dynamic requirements of wireless communication environments. To overcome these limitations, we propose a novel training paradigm termed RLDTF (Reinforcement Learning from Digital Twin Feedback) for 6G-LLMs. This framework leverages network digital twins to generate reward signals based on orchestration outcomes, while employing reinforcement learning to guide the model toward optimal decision-making dynamically. Furthermore, we introduce a weighted token mechanism to improve output accuracy. Comprehensive experimental results demonstrate that our proposed framework significantly outperforms state-of-the-art baselines in orchestration accuracy and solution optimality.

PUBLICATION RECORD

Publication year
2026
Venue
Unknown venue
Publication date
2026-01-15
Fields of study
Computer Science, Engineering
Identifiers
arXiv 2601.09992
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

AI2MMUM: AI-AI Oriented Multi-Modal Universal Model Leveraging Telecom Domain Large Model
2025cited by this paper
When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment
2024cited by this paper
6G Comprehensive Intelligence: Network Operations and Optimization Based on Large Language Models
2024cited by this paper
TelecomGPT: A Framework to Build Telecom-Specific Large Language Models
2024cited by this paper
LLM Agents as 6G Orchestrator: A Paradigm for Task-Oriented Physical-Layer Automation
2024cited by this paper
Large Language Model Enhanced Multi-Agent Systems for 6G Communications
2023cited by this paper
DeepRx: Fully Convolutional Deep Learning Receiver
2020cited by this paper

CITED BY

6G-Bench: An Open Benchmark for Semantic Communication and Network-Level Reasoning with Foundation Models in AI-Native 6G Networks
2026cites this paper