持续不确定性学习 / Continual uncertainty learning
1️⃣ 一句话总结
这篇论文提出了一种新的课程式持续学习框架,通过将复杂的多不确定性控制问题分解为一系列顺序学习任务,并结合模型控制器来加速学习,最终成功设计出能抵抗结构非线性和动态变化的汽车动力系统振动控制器,并实现了从仿真到现实的顺利迁移。
Robust control of mechanical systems with multiple uncertainties remains a fundamental challenge, particularly when nonlinear dynamics and operating-condition variations are intricately intertwined. While deep reinforcement learning (DRL) combined with domain randomization has shown promise in mitigating the sim-to-real gap, simultaneously handling all sources of uncertainty often leads to sub-optimal policies and poor learning efficiency. This study formulates a new curriculum-based continual learning framework for robust control problems involving nonlinear dynamical systems in which multiple sources of uncertainty are simultaneously superimposed. The key idea is to decompose a complex control problem with multiple uncertainties into a sequence of continual learning tasks, in which strategies for handling each uncertainty are acquired sequentially. The original system is extended into a finite set of plants whose dynamic uncertainties are gradually expanded and diversified as learning progresses. The policy is stably updated across the entire plant sets associated with tasks defined by different uncertainty configurations without catastrophic forgetting. To ensure learning efficiency, we jointly incorporate a model-based controller (MBC), which guarantees a shared baseline performance across the plant sets, into the learning process to accelerate the convergence. This residual learning scheme facilitates task-specific optimization of the DRL agent for each uncertainty, thereby enhancing sample efficiency. As a practical industrial application, this study applies the proposed method to designing an active vibration controller for automotive powertrains. We verified that the resulting controller is robust against structural nonlinearities and dynamic variations, realizing successful sim-to-real transfer.
持续不确定性学习 / Continual uncertainty learning
这篇论文提出了一种新的课程式持续学习框架,通过将复杂的多不确定性控制问题分解为一系列顺序学习任务,并结合模型控制器来加速学习,最终成功设计出能抵抗结构非线性和动态变化的汽车动力系统振动控制器,并实现了从仿真到现实的顺利迁移。
源自 arXiv: 2602.17174