Sci-Reasoning: A Dataset Decoding AI Innovation Patterns

📄 Abstract - Sci-Reasoning: A Dataset Decoding AI Innovation Patterns

While AI innovation accelerates rapidly, the intellectual process behind breakthroughs -- how researchers identify gaps, synthesize prior work, and generate insights -- remains poorly understood. The lack of structured data on scientific reasoning hinders systematic analysis and development of AI research agents. We introduce Sci-Reasoning, the first dataset capturing the intellectual synthesis behind high-quality AI research. Using community-validated quality signals and an LLM-accelerated, human-verified pipeline, we trace Oral and Spotlight papers across NeurIPS, ICML, and ICLR (2023-2025) to its key predecessors, articulating specific reasoning links in a structured format. Our analysis identifies 15 distinct thinking patterns, with three dominant strategies accounting for 52.7%: Gap-Driven Reframing (24.2%), Cross-Domain Synthesis (18.0%), and Representation Shift (10.5%). The most powerful innovation recipes combine multiple patterns: Gap-Driven Reframing + Representation Shift, Cross-Domain Synthesis + Representation Shift, and Gap-Driven Reframing + Cross-Domain Synthesis. This dataset enables quantitative studies of scientific progress and provides structured reasoning trajectories for training the next generation AI research agents.

Sci-Reasoning：一个解码人工智能创新模式的数据集 / Sci-Reasoning: A Dataset Decoding AI Innovation Patterns

1️⃣ 一句话总结

这篇论文构建了一个名为Sci-Reasoning的数据集，通过追踪顶级AI会议论文与其关键前作之间的推理联系，首次系统地揭示了AI研究背后的核心思维模式，并发现其中三种主要策略（如填补空白和跨领域融合）主导了超过一半的创新。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要