通过后验采样实现保形语言建模 / Conformal Language Modeling via Posterior Sampling
1️⃣ 一句话总结
本文提出一种新方法,通过在生成过程中对语言模型的后验分布进行采样,而不是事后修正输出,从而在保证统计可靠性的同时,显著提升了生成文本的连贯性和实用性,有效减少了大型语言模型的幻觉问题。
Large Language Models remain plagued by hallucinations. Recent work has sought to tame their prevalence using statistical techniques based on conformal prediction, with both theoretical and empirical success. However, these methods operate in a post-hoc fashion, treating the sampling procedure itself as atomic and then surgically altering samples to remove hallucinated claims. This disconnect between filtering and generation can result in samples that are incoherent, inconsistent, or simply unlikely under the model itself. Moreover, post-hoc surgery is unable to shift probability mass towards more useful and helpful responses. To address these issues, we propose to instead sample from approximations to an LLM posterior, where the conditioning event corresponds to a calibrated, high-scoring region. We develop a calibration procedure tailored to the setting of conditional sequential generation that effectively identifies this region and achieves target risk control. Empirically, we apply our method to case studies focused on open-ended biography generation and mathematical problem solving; compared to prior work, we obtain the same statistical guarantees, with higher downstream utility.
通过后验采样实现保形语言建模 / Conformal Language Modeling via Posterior Sampling
本文提出一种新方法,通过在生成过程中对语言模型的后验分布进行采样,而不是事后修正输出,从而在保证统计可靠性的同时,显著提升了生成文本的连贯性和实用性,有效减少了大型语言模型的幻觉问题。
源自 arXiv: 2606.03731