基于对抗合成场景的机器人安全策略学习 / Learning of Robot Safety Policies via Adversarial Synthetic Scenarios
1️⃣ 一句话总结
本文提出了一种通过红蓝两队对抗游戏生成危险场景的方法,让机器人在模拟的极端情况中不断学习安全策略,以弥补传统随机模拟的不足,为现实世界中的物理AI系统提供可扩展的安全保障。
In this work, we propose an agentic gamification framework for hazard-informed learning of robot safety policies through synthetic scenarios. We model scenario generation as an adversarial game between two agents: a Red Team that explores the space of potential failures by constructing hazardous situations, and a Blue Team that incrementally refines safety policies to prevent them. This iterative process enables efficient discovery of high-risk edge cases that are unlikely to be captured through random simulation or manual enumeration. By combining classical risk modeling with adversarial scenario generation and modern learning paradigms, this work provides a scalable pathway for embedding safety into Physical AI systems operating in complex real-world environments. The paper describes ongoing work. The contribution is a problem formulation and a proposed solution architecture.
基于对抗合成场景的机器人安全策略学习 / Learning of Robot Safety Policies via Adversarial Synthetic Scenarios
本文提出了一种通过红蓝两队对抗游戏生成危险场景的方法,让机器人在模拟的极端情况中不断学习安全策略,以弥补传统随机模拟的不足,为现实世界中的物理AI系统提供可扩展的安全保障。
源自 arXiv: 2606.05952