菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-14
📄 Abstract - HintMR: Eliciting Stronger Mathematical Reasoning in Small Language Models

Small language models (SLMs) often struggle with complex mathematical reasoning due to limited capacity to maintain long chains of intermediate steps and to recover from early errors. We address this challenge by introducing a hint-assisted reasoning framework that incrementally guides SLMs through multi-step mathematical problem solving. Our approach decomposes solutions into sequential reasoning steps and provides context-aware hints, where hints are generated by a separate SLM trained via distillation from a strong large language model. While the hint-generating SLM alone is not capable of solving the problems, its collaboration with a reasoning SLM enables effective guidance, forming a cooperative two-model system for reasoning. Each hint is generated conditionally on the problem statement and the accumulated reasoning history, providing stepwise, localized guidance without revealing full solutions. This reduces error propagation and allows the reasoning model to focus on manageable subproblems. Experiments across diverse mathematical benchmarks and models demonstrate that hint assistance consistently improves reasoning accuracy for SLMs, yielding substantial gains over standard prompting while preserving model efficiency. These results highlight that structured collaboration between SLMs-via hint generation and reasoning-offers an effective and lightweight mechanism for enhancing mathematical reasoning.

顶级标签: llm model training model evaluation
详细标签: mathematical reasoning small language models hint-assisted reasoning knowledge distillation multi-step reasoning 或 搜索:

HintMR:在小语言模型中激发更强的数学推理能力 / HintMR: Eliciting Stronger Mathematical Reasoning in Small Language Models


1️⃣ 一句话总结

这篇论文提出了一种通过提示辅助的协作框架,让两个能力有限的小语言模型一个负责生成解题提示、另一个负责推理,从而有效提升了小模型在复杂数学问题上的解题准确率,且不增加计算负担。

源自 arXiv: 2604.12229