将适应转化为资产:面向在线视觉语言导航的跨域桥接方法 / Turning Adaptation into Assets: Cross-Domain Bridging for Online Vision-Language Navigation
1️⃣ 一句话总结
本文提出一种名为IDEA的全新测试时自适应框架,通过将在线适应过程中获得的知识转化为可积累和组合的“资产”,并在此基础上构建跨域桥接,有效解决了视觉语言导航机器人在动态环境中因遗忘和负迁移导致性能下降的问题。
Navigating under non-stationary environment shifts poses a critical challenge for a Vision-and-Language Navigation (VLN) agent deployed in the wild. Yet, existing Test-Time Adaptation (TTA) methods for VLN largely treat online adaptation as transient, isolated updates, leading to catastrophic forgetting and negative transfer. To overcome these issues, we propose Inter-Domain BridgE with Historical Assets (IDEA), a novel TTA framework that transforms adaptation into the accumulation and composition of assets. Specifically, IDEA introduces soft prompts optimized via a Fisher-guided weighting scheme to capture the transferable knowledge. These optimized prompts are then augmented with domain coordinates to form a dynamic asset library. Leveraging this library, IDEA constructs a cross-domain bridge by projecting the target domain onto the convex hull of historical knowledge. These designs form a complementary loop: the evolving library underpins bridge construction, while the bridge provides superior initialization to accelerate asset optimization. Extensive experiments across REVERIE, R2R, and R2R-CE benchmarks demonstrate the consistent superiority of IDEA over existing methods, showcasing its ability to enable training-free adaptation via asset sharing.
将适应转化为资产:面向在线视觉语言导航的跨域桥接方法 / Turning Adaptation into Assets: Cross-Domain Bridging for Online Vision-Language Navigation
本文提出一种名为IDEA的全新测试时自适应框架,通过将在线适应过程中获得的知识转化为可积累和组合的“资产”,并在此基础上构建跨域桥接,有效解决了视觉语言导航机器人在动态环境中因遗忘和负迁移导致性能下降的问题。
源自 arXiv: 2605.23257