菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-14
📄 Abstract - PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records

While GUI agents have shown strong performance under explicit and completion instructions, real-world deployment requires aligning with users' more complex implicit intents. In this work, we highlight Hierarchical Implicit Intent Alignment for Personalized GUI Agent (PersonalAlign), a new agent task that requires agents to leverage long-term user records as persistent context to resolve omitted preferences in vague instructions and anticipate latent routines by user state for proactive assistance. To facilitate this study, we introduce AndroidIntent, a benchmark designed to evaluate agents' ability in resolving vague instructions and providing proactive suggestions through reasoning over long-term user records. We annotated 775 user-specific preferences and 215 routines from 20k long-term records across different users for evaluation. Furthermore, we introduce Hierarchical Intent Memory Agent (HIM-Agent), which maintains a continuously updating personal memory and hierarchically organizes user preferences and routines for personalization. Finally, we evaluate a range of GUI agents on AndroidIntent, including GPT-5, Qwen3-VL, and UI-TARS, further results show that HIM-Agent significantly improves both execution and proactive performance by 15.7% and 7.3%.

顶级标签: agents systems benchmark
详细标签: gui agents personalization implicit intent long-term memory user behavior modeling 或 搜索:

PersonalAlign:基于长期用户中心记录的个性化GUI智能体分层隐式意图对齐 / PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records


1️⃣ 一句话总结

这篇论文提出了一种名为PersonalAlign的新任务和一个名为AndroidIntent的基准测试,旨在让图形界面智能体能够利用用户的长期操作记录,理解其模糊指令背后的隐藏偏好和习惯,并主动提供个性化服务,同时论文提出的分层记忆智能体模型在该任务上表现显著优于其他现有模型。

源自 arXiv: 2601.09636