菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-03
📄 Abstract - CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Graphical User Interface (GUI) Agents, benefiting from recent advances in multimodal large language models (MLLM), have achieved significant development. However, due to the frequent updates of GUI applications, adapting to new tasks without forgetting old tasks in GUI continual learning remains an open problem. In this work, we reveal that while Supervised Fine-Tuning (SFT) facilitates fast adaptation, it often triggers knowledge overwriting, whereas Reinforcement Learning (RL) demonstrates an inherent resilience that shields prior interaction logic from erasure. Based on this insight, we propose a \textbf{C}ontinual \textbf{G}UI \textbf{L}earning (CGL) framework that dynamically balances adaptation efficiency and skill retention by enhancing the synergy between SFT and RL. Specifically, we introduce an SFT proportion adjustment mechanism guided by policy entropy to dynamically control the weight allocation between the SFT and RL training phases. To resolve explicit gradient interference, we further develop a specialized gradient surgery strategy. By projecting exploratory SFT gradients onto GRPO-based anchor gradients, our method explicitly clips the components of SFT gradients that conflict with GRPO. On top of that, we establish an AndroidControl-CL benchmark, which divides GUI applications into distinct task groups to effectively simulate and evaluate the performance of continual GUI learning. Experimental results demonstrate the effectiveness of our proposed CGL framework across continual learning scenarios. The benchmark, code, and model will be made publicly available.

顶级标签: agents reinforcement learning model training
详细标签: continual learning gui agents reinforcement fine-tuning multimodal llm gradient surgery 或 搜索:

CGL:通过强化微调推进持续GUI学习 / CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning


1️⃣ 一句话总结

这篇论文提出了一个名为CGL的新框架,通过巧妙地结合监督微调和强化学习,并引入动态调整机制与梯度处理策略,有效解决了图形用户界面智能体在持续学习新任务时容易遗忘旧知识的问题。

源自 arXiv: 2603.02951