菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - Revealing Algorithmic Deductive Circuits for Logical Reasoning

Recent studies have shown that Large Language Models (LLMs) can achieve strong reasoning performance by incorporating functional symbolic representations that abstractly describe graph traversal algorithms and step-by-step reasoning in few-shot learning settings. However, it remains unclear how LLMs genuinely understand the abstract meaning of each reasoning step and the overall algorithm from only a limited number of demonstrations. This work aims to localize the attention heads responsible for individual reasoning steps and characterize the types of information transferred among them. We first align constituent reasoning steps with their corresponding token logits under a symbolic-aided Chain-of-Thought (CoT) prompting framework. Our analysis shows that token positions that steer the reasoning process are associated with low confidence scores caused by constraints on satisfying reasoning behavior patterns in demonstrations. We then adopt causal mediation analysis techniques to identify the attention heads responsible for these patterns. In addition, our findings indicate that LLMs retrieve factual and rule-based information for individual sub-reasoning tasks through specialized attention heads (approximately 3% total heads), whereas higher layers predominantly facilitate information integration and the emergence of global reasoning strategies (e.g., graph traversal algorithms) that coordinate multiple intermediate reasoning steps to solve the overall task.

顶级标签: llm natural language processing
详细标签: chain-of-thought reasoning mechanisms attention head analysis causal mediation logical reasoning 或 搜索:

揭示逻辑推理中的算法推演回路 / Revealing Algorithmic Deductive Circuits for Logical Reasoning


1️⃣ 一句话总结

本文发现大型语言模型在少数示例下进行逻辑推理时,少数专用注意力头(约占总数的3%)负责提取事实与规则信息,而更高层则协调这些中间步骤以形成整体推理策略,如图遍历算法。

源自 arXiv: 2605.27824