基于学习型安全滤波器与自适应共形推理的安全控制 / Safe Control using Learned Safety Filters and Adaptive Conformal Inference
1️⃣ 一句话总结
这篇论文提出了一种名为ACoFi的新方法,它通过将学习型安全滤波器与自适应统计推理技术相结合,能够动态调整控制系统的安全切换策略,从而在复杂或未知环境中比传统方法更有效地保障系统安全,同时提供可量化的安全性能保证。
Safety filters have been shown to be effective tools to ensure the safety of control systems with unsafe nominal policies. To address scalability challenges in traditional synthesis methods, learning-based approaches have been proposed for designing safety filters for systems with high-dimensional state and control spaces. However, the inevitable errors in the decisions of these models raise concerns about their reliability and the safety guarantees they offer. This paper presents Adaptive Conformal Filtering (ACoFi), a method that combines learned Hamilton-Jacobi reachability-based safety filters with adaptive conformal inference. Under ACoFi, the filter dynamically adjusts its switching criteria based on the observed errors in its predictions of the safety of actions. The range of possible safety values of the nominal policy's output is used to quantify uncertainty in safety assessment. The filter switches from the nominal policy to the learned safe one when that range suggests it might be unsafe. We show that ACoFi guarantees that the rate of incorrectly quantifying uncertainty in the predicted safety of the nominal policy is asymptotically upper bounded by a user-defined parameter. This gives a soft safety guarantee rather than a hard safety guarantee. We evaluate ACoFi in a Dubins car simulation and a Safety Gymnasium environment, empirically demonstrating that it significantly outperforms the baseline method that uses a fixed switching threshold by achieving higher learned safety values and fewer safety violations, especially in out-of-distribution scenarios.
基于学习型安全滤波器与自适应共形推理的安全控制 / Safe Control using Learned Safety Filters and Adaptive Conformal Inference
这篇论文提出了一种名为ACoFi的新方法,它通过将学习型安全滤波器与自适应统计推理技术相结合,能够动态调整控制系统的安全切换策略,从而在复杂或未知环境中比传统方法更有效地保障系统安全,同时提供可量化的安全性能保证。
源自 arXiv: 2604.18482