用于智能信息搜索的嵌套浏览器使用学习 / Nested Browser-Use Learning for Agentic Information Seeking
1️⃣ 一句话总结
这篇论文提出了一种名为NestBrowse的新方法,通过设计一个嵌套的浏览器操作框架,让AI智能体能够像人一样直接、高效地操作真实浏览器来获取深层网络信息,从而解决了现有信息搜索工具功能受限、难以处理复杂网页内容的问题。
Information-seeking (IS) agents have achieved strong performance across a range of wide and deep search tasks, yet their tool use remains largely restricted to API-level snippet retrieval and URL-based page fetching, limiting access to the richer information available through real browsing. While full browser interaction could unlock deeper capabilities, its fine-grained control and verbose page content returns introduce substantial complexity for ReAct-style function-calling agents. To bridge this gap, we propose Nested Browser-Use Learning (NestBrowse), which introduces a minimal and complete browser-action framework that decouples interaction control from page exploration through a nested structure. This design simplifies agentic reasoning while enabling effective deep-web information acquisition. Empirical results on challenging deep IS benchmarks demonstrate that NestBrowse offers clear benefits in practice. Further in-depth analyses underscore its efficiency and flexibility.
用于智能信息搜索的嵌套浏览器使用学习 / Nested Browser-Use Learning for Agentic Information Seeking
这篇论文提出了一种名为NestBrowse的新方法,通过设计一个嵌套的浏览器操作框架,让AI智能体能够像人一样直接、高效地操作真实浏览器来获取深层网络信息,从而解决了现有信息搜索工具功能受限、难以处理复杂网页内容的问题。
源自 arXiv: 2512.23647