菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-09
📄 Abstract - Do Language Models Know Theo Has a Wife? Investigating the Proviso Problem

We investigate how language models handle the proviso problem, an unresolved issue in pragmatics where presuppositions in conditional sentences diverge between theoretical and human interpretations. We reformulate this phenomenon as a Natural Language Inference task and introduce a diagnostic dataset designed to probe presupposition projection in conditionals. We evaluate RoBERTa, DeBERTa, LLaMA, and Gemma using explainability analyses. The results show that models broadly align with human judgments but rely on shallow pattern matching rather than semantic or pragmatic reasoning. Our work provides the first computational evaluation framework for the proviso problem and highlights the need for diagnostic, multi-method approaches to assess pragmatic competence and context-dependent meaning in language models.

顶级标签: llm natural language processing model evaluation
详细标签: pragmatics presupposition natural language inference diagnostic dataset explainability 或 搜索:

语言模型知道Theo有妻子吗?探究附带条件问题 / Do Language Models Know Theo Has a Wife? Investigating the Proviso Problem


1️⃣ 一句话总结

这篇论文通过将语用学中一个关于条件句预设的未解难题转化为自然语言推理任务,并构建诊断数据集来测试主流语言模型,发现这些模型虽然总体上能做出与人类相似的判断,但其依据是浅层的模式匹配而非深层的语义或语用推理。

源自 arXiv: 2603.08358