RL-STPA: Adapting System-Theoretic Hazard Analysis for Safety-Critical Reinforcement Learning

📄 Abstract - RL-STPA: Adapting System-Theoretic Hazard Analysis for Safety-Critical Reinforcement Learning

As reinforcement learning (RL) deployments expand into safety-critical domains, existing evaluation methods fail to systematically identify hazards arising from the black-box nature of neural network enabled policies and distributional shift between training and deployment. This paper introduces Reinforcement Learning System-Theoretic Process Analysis (RL-STPA), a framework that adapts conventional STPA's systematic hazard analysis to address RL's unique challenges through three key contributions: hierarchical subtask decomposition using both temporal phase analysis and domain expertise to capture emergent behaviors, coverage-guided perturbation testing that explores the sensitivity of state-action spaces, and iterative checkpoints that feed identified hazards back into training through reward shaping and curriculum design. We demonstrate RL-STPA in the safety-critical test case of autonomous drone navigation and landing, revealing potential loss scenarios that can be missed by standard RL evaluations. The proposed framework provides practitioners with a toolkit for systematic hazard analysis, quantitative metrics for safety coverage assessment, and actionable guidelines for establishing operational safety bounds. While RL-STPA cannot provide formal guarantees for arbitrary neural policies, it offers a practical methodology for systematically evaluating and improving RL safety and robustness in safety-critical applications where exhaustive verification methods remain intractable.

RL-STPA：面向安全关键强化学习的系统理论危险分析适配方法 / RL-STPA: Adapting System-Theoretic Hazard Analysis for Safety-Critical Reinforcement Learning

1️⃣ 一句话总结

这篇论文提出了一种名为RL-STPA的新框架，它通过结合分层任务分解、覆盖引导的扰动测试和迭代检查点等方法，系统性地识别和应对安全关键领域（如自主无人机导航）中强化学习模型因‘黑箱’特性和数据分布变化可能引发的潜在危险，为实际应用提供了一套实用的安全评估与改进工具。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要