Interaction Dynamics as a Reward Signal for LLMs

Published 2025 in arXiv.org

ABSTRACT

The alignment of Large Language Models (LLMs) for multi-turn conversations typically relies on reward signals derived from the content of the text. This approach, however, overlooks a rich, complementary source of signal: the dynamics of the interaction itself. This paper introduces TRACE (Trajectory-based Reward for Agent Collaboration Estimation), a novel reward signal derived from the geometric properties of a dialogue's embedding trajectory--a concept we term'conversational geometry'. Our central finding is that a reward model trained only on these structural signals achieves a pairwise accuracy (68.20%) comparable to a powerful LLM baseline that analyzes the full transcript (70.04%). Furthermore, a hybrid model combining interaction dynamics with textual analysis achieves the highest performance (80.17%), demonstrating their complementary nature. This work provides strong evidence that for interactive settings, how an agent communicates is as powerful a predictor of success as what it says, offering a new, privacy-preserving framework that not only aligns agents but also serves as a diagnostic tool for understanding the distinct interaction patterns that drive successful collaboration.

PUBLICATION RECORD

Publication year
2025
Venue
arXiv.org
Publication date
2025-11-11
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2511.08394 arXiv 2511.08394
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Writing as a testbed for open ended agents
2025cited by this paper
Opportunities and Challenges of LLMs in Education: An NLP Perspective
2025cited by this paper
Evaluating Human-AI Collaboration: A Review and Methodological Framework
2024cited by this paper
Towards Next-Generation Intelligent Assistants Leveraging LLM Techniques
2023cited by this paper
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
2023cited by this paper
CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities
2022cited by this paper
Co-Writing Screenplays and Theatre Scripts with Language Models: Evaluation by Industry Professionals
2022cited by this paper

CITED BY

MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games
2026cites this paper
TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training
2026cites this paper