FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

📄 Abstract - FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

Live future prediction refers to the task of making predictions about real-world events before they unfold. This task is increasingly studied using large language model-based agent systems, and it is important for building agents that can continually learn from real-world. Just as interactive environments have often driven progress in agents, advancing live future prediction naturally motivates viewing it as a learning environment. Prior works have explored future prediction from several different parts, but have generally not framed it as a unified learning environment. This task is appealing for learning because it can provide a large number of prediction questions grounded in diverse real-world events, while preventing answer leakage. To leverage the advantages of live future prediction, we present FutureWorld, a live agentic reinforcement learning environment that closes the training loop between prediction, outcome realization, and parameters update. In our environment, we take three open-source base models and train them for consecutive days. The results show that training is effective. Furthermore, we build a daily benchmark based on the environment and evaluate several frontier agents on it to establish performance baselines for current agent systems.

未来世界：一个利用真实世界结果奖励训练预测型智能体的实时环境 / FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

1️⃣ 一句话总结

本文提出了一个名为“未来世界”的实时强化学习环境，让AI智能体可以在真实世界事件发生前进行预测，并根据事后结果自动获得奖励来更新自身参数，从而持续学习，实验证明这种训练方式能有效提升模型性能。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要