Adaptive Testing for Segmenting Watermarked Texts From Language Models

Published 2025 in Stat

ABSTRACT

The rapid adoption of large language models (LLMs), such as GPT‐4 and Claude 3.5, underscores the need to distinguish LLM‐generated text from human‐written content to mitigate the spread of misinformation and misuse in education. One promising approach to address this issue is the watermark technique, which embeds subtle statistical signals into LLM‐generated text to enable reliable identification. In this paper, we first generalize the likelihood‐based LLM detection method by introducing a flexible weighted formulation and further adapt this approach to the inverse transform sampling method. Moving beyond watermark detection, we extend this adaptive detection strategy to tackle the more challenging problem of segmenting a given text into watermarked and non‐watermarked substrings. In contrast to the approach in a previous study, which relies on accurate estimation of next‐token probabilities that are highly sensitive to prompt estimation, our proposed framework removes the need for precise prompt estimation. Extensive numerical experiments demonstrate that the proposed methodology is both effective and robust in accurately segmenting texts containing a mixture of watermarked and non‐watermarked content.

PUBLICATION RECORD

Publication year
2025
Venue
Stat
Publication date
2025-11-10
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1002/sta4.70118 arXiv 2511.06645
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

A Likelihood Based Approach for Watermark Detection
2025influential reference
Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs
2024cited by this paper
SoK: Watermarking for AI-Generated Content
2024cited by this paper
Robust Detection of Watermarks for Large Language Models Under Human Edits
2024cited by this paper
Debiasing Watermarks for Large Language Models via Maximal Coupling
2024influential reference
Segmenting Watermarked Texts From Language Models
2024cited by this paper
Scalable watermarking for identifying large language model outputs
2024cited by this paper
Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature
2024cited by this paper
fastcpd: Fast Change Point Detection in R
2024cited by this paper
A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules
2024cited by this paper
Towards Better Statistical Understanding of Watermarking LLMs
2024cited by this paper
Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models
2024cited by this paper
Adaptive Text Watermark for Large Language Models
2024cited by this paper
GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick
2024cited by this paper
Unbiased Watermark for Large Language Models
2023cited by this paper
A Watermark for Large Language Models
2023cited by this paper
Protecting Language Generation Models via Invisible Watermarking
2023cited by this paper
Who Wrote this Code? Watermarking for Code Generation
2023cited by this paper
Undetectable Watermarks for Language Models
2023cited by this paper
On the Reliability of Watermarks for Large Language Models
2023cited by this paper
Robust Distortion-free Watermarks for Language Models
2023influential reference
An Unforgeable Publicly Verifiable Watermark for Large Language Models
2023cited by this paper
Three Bricks to Consolidate Watermarks for Large Language Models
2023cited by this paper
An Information-Theoretic Method for Detecting Edits in AI-Generated Text
2023cited by this paper
Mistral 7B
2023cited by this paper
A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models
2023cited by this paper
A Survey of Text Watermarking in the Era of Large Language Models
2023cited by this paper
Towards Optimal Statistical Watermarking
2023cited by this paper
Sequential Gradient Descent and Quasi-Newton's Method for Change-Point Analysis
2022cited by this paper
Selective review of offline change point detection methods
2020cited by this paper
Seeded Binary Segmentation: A general methodology for fast and optimal changepoint detection
2020cited by this paper
Detection of sparse mixtures: higher criticism and scan statistic
2018cited by this paper
Narrowest‐over‐threshold detection of multiple change points and change‐point‐like features
2016cited by this paper
Wild binary segmentation for multiple change-point detection
2014influential reference
Optimal Detection of Sparse Mixtures Against a Given Null Distribution
2014cited by this paper
Perturb-and-MAP random fields: Using discrete optimization to learn and sample from energy models
2011cited by this paper
Optimal detection of changepoints with a linear computational cost
2011cited by this paper
Multiple Change-Point Estimation With a Total Variation Penalty
2010cited by this paper
Higher criticism for detecting sparse heterogeneous mixtures
2004cited by this paper
Circular binary segmentation for the analysis of array-based DNA copy number data.
2004cited by this paper
Theoretical comparisons of block bootstrap methods
1999cited by this paper
Moving blocks jackknife and bootstrap capture weak dependence
1992cited by this paper
The Jackknife and the Bootstrap for General Stationary Observations
1989cited by this paper
Robust Regression: Asymptotics, Conjectures and Monte Carlo
1973cited by this paper

CITED BY

What Distinguishes AI-Generated from Human Writing? A Rapid Review of the Literature
2026influential citation