Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks

📄 Abstract - Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks

Shielded reinforcement learning is typically presented as a runtime safety mechanism that compiles temporal-logic specifications into automata restricting an agent's actions. We argue this is the wrong product. The same automata-theoretic machinery -- specification compilation, product game construction, attractor computation, and winning-region extraction -- is better read as a design-time analytical instrument whose outputs are structural insights about a system rather than runtime constraints on a deployed agent. We instantiate this through a constrained two-player safety game for network defense. The two specifications are enforced asymmetrically: the defender specification defines the unsafe region of the game, whereas the attacker specification restricts the adversary's legal actions during attractor computation. Solving the game yields a defensibility verdict -- a formal certificate that a topology-specification pair is or is not defensible -- with the associated winning region and shield. Beyond the binary verdict, we derive topology-level metrics from the attractor structure and combine them with post-convergence behavior from shield-constrained adversarial multi-agent reinforcement learning. Together these form a defensibility fingerprint capturing both a network's formal safety properties and its operational behavior under adaptive play. A what-if analysis shows that formal defensibility and operational effectiveness capture distinct aspects of security: small architectural changes can produce large shifts in operational outcomes while leaving formal safety margins nearly unchanged. Shield synthesis is thus most valuable not as a deployment mechanism for safe agents, but as a framework for answering architectural questions about whether, where, and how a system can be defended. The defensibility verdict is the output, not the safe policy.

超越运行时强制：将盾牌综合视为对抗网络的可防御性分析 / Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks

1️⃣ 一句话总结

本文提出将强化学习中的安全盾牌机制从运行时的约束工具，重新理解为一种设计阶段的分析方法：通过构建双人安全博弈并提取可防御性指纹，来评估网络拓扑是否真的能被防御，从而为系统的安全设计提供结构性洞见，而非仅仅训练一个安全运行的智能体。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要