AutoWebWorld:通过有限状态机合成无限可验证的网页环境 / AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines
1️⃣ 一句话总结
这篇论文提出了一个名为AutoWebWorld的新方法,它通过将网页环境建模为有限状态机并自动生成交互式网站,从而低成本、大批量地创造出状态清晰、每一步都可验证的训练数据,显著提升了自主网页操作AI在真实网站上的表现。
The performance of autonomous Web GUI agents heavily relies on the quality and quantity of their training data. However, a fundamental bottleneck persists: collecting interaction trajectories from real-world websites is expensive and difficult to verify. The underlying state transitions are hidden, leading to reliance on inconsistent and costly external verifiers to evaluate step-level correctness. To address this, we propose AutoWebWorld, a novel framework for synthesizing controllable and verifiable web environments by modeling them as Finite State Machines (FSMs) and use coding agents to translate FSMs into interactive websites. Unlike real websites, where state transitions are implicit, AutoWebWorld explicitly defines all states, actions, and transition rules. This enables programmatic verification: action correctness is checked against predefined rules, and task success is confirmed by reaching a goal state in the FSM graph. AutoWebWorld enables a fully automated search-and-verify pipeline, generating over 11,663 verified trajectories from 29 diverse web environments at only $0.04 per trajectory. Training on this synthetic data significantly boosts real-world performance. Our 7B Web GUI agent outperforms all baselines within 15 steps on WebVoyager. Furthermore, we observe a clear scaling law: as the synthetic data volume increases, performance on WebVoyager and Online-Mind2Web consistently improves.
AutoWebWorld:通过有限状态机合成无限可验证的网页环境 / AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines
这篇论文提出了一个名为AutoWebWorld的新方法,它通过将网页环境建模为有限状态机并自动生成交互式网站,从而低成本、大批量地创造出状态清晰、每一步都可验证的训练数据,显著提升了自主网页操作AI在真实网站上的表现。
源自 arXiv: 2602.14296