迈向一致的世界模型:基于多令牌预测与潜在语义增强的方法 / Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement
1️⃣ 一句话总结
这篇论文提出了一种名为‘潜在语义增强的多令牌预测’新方法,通过将离散的令牌预测与真实隐藏状态轨迹对齐,有效减少了大型语言模型在构建内部世界模型时产生的结构性幻觉,使其表示更准确、更稳健。
Whether Large Language Models (LLMs) develop coherent internal world models remains a core debate. While conventional Next-Token Prediction (NTP) focuses on one-step-ahead supervision, Multi-Token Prediction (MTP) has shown promise in learning more structured representations. In this work, we provide a theoretical perspective analyzing the gradient inductive bias of MTP, supported by empirical evidence, showing that MTP promotes the convergence toward internal belief states by inducing representational contractivity via gradient coupling. However, we reveal that standard MTP often suffers from structural hallucinations, where discrete token supervision encourages illegal shortcuts in latent space that violate environmental constraints. To address this, we propose a novel method Latent Semantic Enhancement MTP (LSE-MTP), which anchors predictions to ground-truth hidden state trajectories. Experiments on synthetic graphs and real-world Manhattan Taxi Ride show that LSE-MTP effectively bridges the gap between discrete tokens and continuous state representations, enhancing representation alignment, reducing structural hallucinations, and improving robustness to perturbations.
迈向一致的世界模型:基于多令牌预测与潜在语义增强的方法 / Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement
这篇论文提出了一种名为‘潜在语义增强的多令牌预测’新方法,通过将离散的令牌预测与真实隐藏状态轨迹对齐,有效减少了大型语言模型在构建内部世界模型时产生的结构性幻觉,使其表示更准确、更稳健。
源自 arXiv: 2604.06155