Seed-Prover 1.5:通过经验学习掌握本科水平定理证明 / Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience
1️⃣ 一句话总结
这项研究提出了一个名为Seed-Prover 1.5的定理证明模型,它通过让AI模型在形式化数学环境中不断试错和积累经验来学习,从而高效地解决了从本科到博士级别的数学难题,其性能超越了现有方法。
Large language models have recently made significant progress to generate rigorous mathematical proofs. In contrast, utilizing LLMs for theorem proving in formal languages (such as Lean) remains challenging and computationally expensive, particularly when addressing problems at the undergraduate level and beyond. In this work, we present \textbf{Seed-Prover 1.5}, a formal theorem-proving model trained via large-scale agentic reinforcement learning, alongside an efficient test-time scaling (TTS) workflow. Through extensive interactions with Lean and other tools, the model continuously accumulates experience during the RL process, substantially enhancing the capability and efficiency of formal theorem proving. Furthermore, leveraging recent advancements in natural language proving, our TTS workflow efficiently bridges the gap between natural and formal languages. Compared to state-of-the-art methods, Seed-Prover 1.5 achieves superior performance with a smaller compute budget. It solves \textbf{88\% of PutnamBench} (undergraduate-level), \textbf{80\% of Fate-H} (graduate-level), and \textbf{33\% of Fate-X} (PhD-level) problems. Notably, using our system, we solved \textbf{11 out of 12 problems} from Putnam 2025 within 9 hours. Our findings suggest that scaling learning from experience, driven by high-quality formal feedback, holds immense potential for the future of formal mathematical reasoning.
Seed-Prover 1.5:通过经验学习掌握本科水平定理证明 / Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience
这项研究提出了一个名为Seed-Prover 1.5的定理证明模型,它通过让AI模型在形式化数学环境中不断试错和积累经验来学习,从而高效地解决了从本科到博士级别的数学难题,其性能超越了现有方法。
源自 arXiv: 2512.17260