菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-15
📄 Abstract - Olmo 3

We introduce Olmo 3, a family of state-of-the-art, fully-open language models at the 7B and 32B parameter scales. Olmo 3 model construction targets long-context reasoning, function calling, coding, instruction following, general chat, and knowledge recall. This release includes the entire model flow, i.e., the full lifecycle of the family of models, including every stage, checkpoint, data point, and dependency used to build it. Our flagship model, Olmo 3 Think 32B, is the strongest fully-open thinking model released to-date.

顶级标签: llm model training systems
详细标签: open-source llm long-context reasoning function calling model lifecycle language model 或 搜索:

Olmo 3 / Olmo 3


1️⃣ 一句话总结

这篇论文介绍了名为Olmo 3的系列开源大语言模型,包含70亿和320亿参数两个版本,特别擅长处理长文本推理、代码生成和指令跟随等任务,并完全公开了从数据到训练的所有细节,其中最强的320亿参数模型是目前性能最好的开源推理模型。


源自 arXiv: 2512.13961