菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-04
📄 Abstract - AILS-NTUA at SemEval-2026 Task 12: Graph-Based Retrieval and Reflective Prompting for Abductive Event Reasoning

We present a winning three-stage system for SemEval 2026 Task~12: Abductive Event Reasoning that combines graph-based retrieval, LLM-driven abductive reasoning with prompt design optimized through reflective prompt evolution, and post-hoc consistency enforcement; our system ranks first on the evaluation-phase leaderboard with an accuracy score of 0.95. Cross-model error analysis across 14 models (7~families) reveals three shared inductive biases: causal chain incompleteness, proximate cause preference, and salience bias, whose cross-family convergence (51\% cause-count reduction) indicates systematic rather than model-specific failure modes in multi-label causal reasoning.

顶级标签: llm natural language processing agents
详细标签: abductive reasoning graph retrieval prompt engineering causal reasoning error analysis 或 搜索:

AILS-NTUA在SemEval-2026任务12中的方案:基于图检索与反思性提示优化的溯因事件推理 / AILS-NTUA at SemEval-2026 Task 12: Graph-Based Retrieval and Reflective Prompting for Abductive Event Reasoning


1️⃣ 一句话总结

这篇论文介绍了一个在溯因事件推理竞赛中夺冠的三阶段系统,它通过图检索、基于大语言模型的推理与反思性提示优化相结合,取得了最高准确率,并揭示了当前多标签因果推理模型中普遍存在的三种系统性偏差。

源自 arXiv: 2603.04319