迈向任意时间有效的统计水印 / Towards Anytime-Valid Statistical Watermarking
1️⃣ 一句话总结
这篇论文提出了一种名为‘锚定E-水印’的新方法,它首次将最优采样与任意时间有效的推理结合起来,使得在大语言模型生成的文本中嵌入和检测水印时,可以在任意时刻停止检测并保持统计有效性,从而将检测所需的平均文本量减少了13-15%。
The proliferation of Large Language Models (LLMs) necessitates efficient mechanisms to distinguish machine-generated content from human text. While statistical watermarking has emerged as a promising solution, existing methods suffer from two critical limitations: the lack of a principled approach for selecting sampling distributions and the reliance on fixed-horizon hypothesis testing, which precludes valid early stopping. In this paper, we bridge this gap by developing the first e-value-based watermarking framework, Anchored E-Watermarking, that unifies optimal sampling with anytime-valid inference. Unlike traditional approaches where optional stopping invalidates Type-I error guarantees, our framework enables valid, anytime-inference by constructing a test supermartingale for the detection process. By leveraging an anchor distribution to approximate the target model, we characterize the optimal e-value with respect to the worst-case log-growth rate and derive the optimal expected stopping time. Our theoretical claims are substantiated by simulations and evaluations on established benchmarks, showing that our framework can significantly enhance sample efficiency, reducing the average token budget required for detection by 13-15% relative to state-of-the-art baselines.
迈向任意时间有效的统计水印 / Towards Anytime-Valid Statistical Watermarking
这篇论文提出了一种名为‘锚定E-水印’的新方法,它首次将最优采样与任意时间有效的推理结合起来,使得在大语言模型生成的文本中嵌入和检测水印时,可以在任意时刻停止检测并保持统计有效性,从而将检测所需的平均文本量减少了13-15%。
源自 arXiv: 2602.17608