面向自主水面艇的自适应强化学习跨平台控制方法 / Cross-Platform Control for Autonomous Surface Vehicles via Adaptive Reinforcement Learning
1️⃣ 一句话总结
本文提出一种自适应强化学习方法,通过让策略学习一个隐藏的动力学特征表示,实现单一控制策略在无需微调的情况下零样本部署到不同水面艇上,并在真实实验中比非自适应方法降低最高58%的位置跟踪误差。
Autonomous surface vehicles vary widely in hydrodynamic and actuation characteristics, yet most controllers are designed for single-platform deployment. We present an adaptive reinforcement learning approach for trajectory tracking that enables zero-shot cross-platform deployment using a single policy. Since the deployment platform's dynamics are unknown to the policy, we address cross-platform generalization with the standard partial-observability approach of conditioning on interaction history, employing a teacher-student architecture in which a learned module infers a latent representation of the platform dynamics. The policy is trained in simulation under randomized vessel dynamics and is deployed zero-shot to two real-world platforms without any fine-tuning, despite relying on a simple analytical dynamics model rather than a high-fidelity hydrodynamic simulator. In real-world experiments on two different platforms, the adaptive policy outperforms non-adaptive learning-based baselines by up to 58% in position mean absolute error while approaching the tracking accuracy of a platform-specific tuned controller.
面向自主水面艇的自适应强化学习跨平台控制方法 / Cross-Platform Control for Autonomous Surface Vehicles via Adaptive Reinforcement Learning
本文提出一种自适应强化学习方法,通过让策略学习一个隐藏的动力学特征表示,实现单一控制策略在无需微调的情况下零样本部署到不同水面艇上,并在真实实验中比非自适应方法降低最高58%的位置跟踪误差。
源自 arXiv: 2607.02037