菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-09
📄 Abstract - Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense

The rapid evolution of GUI-enabled agents has rendered traditional CAPTCHAs obsolete. While previous benchmarks like OpenCaptchaWorld established a baseline for evaluating multimodal agents, recent advancements in reasoning-heavy models, such as Gemini3-Pro-High and GPT-5.2-Xhigh have effectively collapsed this security barrier, achieving pass rates as high as 90% on complex logic puzzles like "Bingo". In response, we introduce Next-Gen CAPTCHAs, a scalable defense framework designed to secure the next-generation web against the advanced agents. Unlike static datasets, our benchmark is built upon a robust data generation pipeline, allowing for large-scale and easily scalable evaluations, notably, for backend-supported types, our system is capable of generating effectively unbounded CAPTCHA instances. We exploit the persistent human-agent "Cognitive Gap" in interactive perception, memory, decision-making, and action. By engineering dynamic tasks that require adaptive intuition rather than granular planning, we re-establish a robust distinction between biological users and artificial agents, offering a scalable and diverse defense mechanism for the agentic era.

顶级标签: agents benchmark systems
详细标签: captcha security gui agents cognitive gap evaluation framework 或 搜索:

下一代验证码:利用认知鸿沟构建可扩展且多样化的图形界面智能体防御 / Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense


1️⃣ 一句话总结

这篇论文提出了一种新的验证码防御框架,它通过设计需要人类直觉而非精确规划的动态交互任务,利用人与AI在认知上的根本差异,来有效区分真实用户和高级智能体,从而为网络提供可大规模扩展的安全防护。

源自 arXiv: 2602.09012