📄
Abstract - AILS-NTUA at SemEval-2026 Task 12: Graph-Based Retrieval and Reflective Prompting for Abductive Event Reasoning
We present a winning three-stage system for SemEval 2026 Task~12: Abductive Event Reasoning that combines graph-based retrieval, LLM-driven abductive reasoning with prompt design optimized through reflective prompt evolution, and post-hoc consistency enforcement; our system ranks first on the evaluation-phase leaderboard with an accuracy score of 0.95. Cross-model error analysis across 14 models (7~families) reveals three shared inductive biases: causal chain incompleteness, proximate cause preference, and salience bias, whose cross-family convergence (51\% cause-count reduction) indicates systematic rather than model-specific failure modes in multi-label causal reasoning.
AILS-NTUA在SemEval-2026任务12中的方案:基于图检索与反思性提示优化的溯因事件推理 /
AILS-NTUA at SemEval-2026 Task 12: Graph-Based Retrieval and Reflective Prompting for Abductive Event Reasoning
1️⃣ 一句话总结
这篇论文介绍了一个在溯因事件推理竞赛中夺冠的三阶段系统,它通过图检索、基于大语言模型的推理与反思性提示优化相结合,取得了最高准确率,并揭示了当前多标签因果推理模型中普遍存在的三种系统性偏差。