METIS:用于深度探究与解决方案的智能导师引擎 / METIS: Mentoring Engine for Thoughtful Inquiry & Solutions
1️⃣ 一句话总结
这篇论文开发了一个名为METIS的AI研究导师,它通过分阶段指导、文献检索和工具辅助,有效帮助本科生从初步想法完成研究论文,并在多个评估中表现优于主流大模型。
Many students lack access to expert research mentorship. We ask whether an AI mentor can move undergraduates from an idea to a paper. We build METIS, a tool-augmented, stage-aware assistant with literature search, curated guidelines, methodology checks, and memory. We evaluate METIS against GPT-5 and Claude Sonnet 4.5 across six writing stages using LLM-as-a-judge pairwise preferences, student-persona rubrics, short multi-turn tutoring, and evidence/compliance checks. On 90 single-turn prompts, LLM judges preferred METIS to Claude Sonnet 4.5 in 71% and to GPT-5 in 54%. Student scores (clarity/actionability/constraint-fit; 90 prompts x 3 judges) are higher across stages. In multi-turn sessions (five scenarios/agent), METIS yields slightly higher final quality than GPT-5. Gains concentrate in document-grounded stages (D-F), consistent with stage-aware routing and groundings failure modes include premature tool routing, shallow grounding, and occasional stage misclassification.
METIS:用于深度探究与解决方案的智能导师引擎 / METIS: Mentoring Engine for Thoughtful Inquiry & Solutions
这篇论文开发了一个名为METIS的AI研究导师,它通过分阶段指导、文献检索和工具辅助,有效帮助本科生从初步想法完成研究论文,并在多个评估中表现优于主流大模型。
源自 arXiv: 2601.13075