菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-31
📄 Abstract - Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Agentic crafting requires LLMs to operate in real-world environments over multiple turns by taking actions, observing outcomes, and iteratively refining artifacts. Despite its importance, the open-source community lacks a principled, end-to-end ecosystem to streamline agent development. We introduce the Agentic Learning Ecosystem (ALE), a foundational infrastructure that optimizes the production pipeline for agent LLMs. ALE consists of three components: ROLL, a post-training framework for weight optimization; ROCK, a sandbox environment manager for trajectory generation; and iFlow CLI, an agent framework for efficient context engineering. We release ROME (ROME is Obviously an Agentic Model), an open-source agent grounded by ALE and trained on over one million trajectories. Our approach includes data composition protocols for synthesizing complex behaviors and a novel policy optimization algorithm, Interaction-based Policy Alignment (IPA), which assigns credit over semantic interaction chunks rather than individual tokens to improve long-horizon training stability. Empirically, we evaluate ROME within a structured setting and introduce Terminal Bench Pro, a benchmark with improved scale and contamination control. ROME demonstrates strong performance across benchmarks like SWE-bench Verified and Terminal Bench, proving the effectiveness of the ALE infrastructure.

顶级标签: llm agents systems
详细标签: agentic learning policy optimization training ecosystem interaction-based alignment benchmark evaluation 或 搜索:

任其流动:在开放智能体学习生态系统中构建摇滚乐与ROME模型 / Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem


1️⃣ 一句话总结

这篇论文提出了一个名为ALE的开放智能体学习生态系统,并基于此训练出名为ROME的开源智能体模型,通过整合数据合成、新型训练算法和评估基准,解决了当前智能体开发缺乏标准化、端到端基础设施的难题。

源自 arXiv: 2512.24873