📄
Abstract - Your Neighbors Know: Leveraging Local Neighborhoods for Backdoor Detection in Decentralized Learning
Decentralized learning (DL) is an emerging machine learning paradigm where nodes collaboratively train models without a central server. However, the collaborative nature of DL makes it vulnerable to backdoor attacks, where a model is taught to behave normally on standard inputs while executing hidden, malicious actions when encountering data with specific triggers. Backdoor attacks in DL remain understudied and existing defenses often overlook DL constraints. We introduce Argus, a novel backdoor detection framework native to DL that requires neither a central coordinator nor prior knowledge of the trigger. In Argus, honest nodes locally analyze received model updates to identify potential backdoor triggers. Nodes then collectively share their triggers with their neighbors and use a structural similarity metric to separate true backdoors from false alarms induced by data heterogeneity. A key insight is that false positive triggers exhibit inconsistencies across participants while true positive ones show consistent patterns. Model updates that fail this collaborative test are rejected, and persistently malicious senders are eventually evicted. We provide the first theoretical convergence guarantees for a DL-specific backdoor detection mechanism, showing that filtering out suspicious model updates with high probability preserves a convergence rate comparable to standard DL. We implement and evaluate Argus on three standard datasets and against three state-of-the-art baselines. Across settings, Argus reduces attack success rates by up to 90 points compared to no defense, while preserving model utility within 5 percentage points of an omniscient oracle. Furthermore, the effectiveness of Argus compared to baselines improves as data heterogeneity increases.
你的邻居知道:利用局部邻居进行去中心化学习中的后门检测 /
Your Neighbors Know: Leveraging Local Neighborhoods for Backdoor Detection in Decentralized Learning
1️⃣ 一句话总结
本文提出了一种名为Argus的去中心化学习后门攻击检测框架,它让诚实节点通过分析邻居的模型更新来发现可疑后门模式,并利用结构相似性指标区分真后门和误报,从而在无中央服务器的情况下高效防御攻击,同时保证模型准确率几乎不受影响。