探测语言模型中的阅读时间 / Probing for Reading Times
1️⃣ 一句话总结
本研究通过分析五种语言的眼动数据,发现语言模型早期层级的表征能有效预测人类阅读中的早期注视行为(如首次注视时长),而传统的概率预测指标(如惊讶度)则在预测整体阅读时长上表现更佳,揭示了模型深度与人类阅读时间阶段之间存在功能上的对齐关系。
Probing has shown that language model representations encode rich linguistic information, but it remains unclear whether they also capture cognitive signals about human processing. In this work, we probe language model representations for human reading times. Using regularized linear regression on two eye-tracking corpora spanning five languages (English, Greek, Hebrew, Russian, and Turkish), we compare the representations from every model layer against scalar predictors -- surprisal, information value, and logit-lens surprisal. We find that the representations from early layers outperform surprisal in predicting early-pass measures such as first fixation and gaze duration. The concentration of predictive power in the early layers suggests that human-like processing signatures are captured by low-level structural or lexical representations, pointing to a functional alignment between model depth and the temporal stages of human reading. In contrast, for late-pass measures such as total reading time, scalar surprisal remains superior, despite its being a much more compressed representation. We also observe performance gains when using both surprisal and early-layer representations. Overall, we find that the best-performing predictor varies strongly depending on the language and eye-tracking measure.
探测语言模型中的阅读时间 / Probing for Reading Times
本研究通过分析五种语言的眼动数据,发现语言模型早期层级的表征能有效预测人类阅读中的早期注视行为(如首次注视时长),而传统的概率预测指标(如惊讶度)则在预测整体阅读时长上表现更佳,揭示了模型深度与人类阅读时间阶段之间存在功能上的对齐关系。
源自 arXiv: 2604.18712