多样性中的可检测性:一种用于单次训练中隐私审计的金丝雀样本优化方法 / Detectability in Diversity: Improved Canary Crafting for Privacy Auditing in One Run
1️⃣ 一句话总结
本文提出了一种通过优化“金丝雀”样本(用于测试模型是否记住特定数据的特殊样本)的方法,使其既容易被检测到,彼此之间又不相互干扰,从而在单次模型训练中更高效、更准确地评估机器学习模型的隐私泄露风险。
Privacy auditing aims to empirically assess privacy leakage in machine learning models using membership inference attacks (MIAs), and to derive lower bounds on differential privacy (DP) parameters. Recent one-run auditing methods address the high cost of standard approaches by relying on a single training run with multiple "canary" points whose inclusion or exclusion must be detected by the auditor. In this work, we study the problem of efficiently crafting canaries for one-run privacy auditing. Motivated by recent theoretical insights suggesting that interference between canaries contributes to weaker leakage estimates compared to multi-run methods, we propose to optimize canaries to be both highly detectable and minimally interfering. Our approach combines a greedy initialization based on influence functions with a bilevel optimization procedure that maximizes distinguishability while promoting diversity in embedding space, enabling the use of computationally efficient bilevel algorithms. Experiments show that our method achieves stronger privacy leakage estimates at a lower computational cost than existing canary crafting approaches.
多样性中的可检测性:一种用于单次训练中隐私审计的金丝雀样本优化方法 / Detectability in Diversity: Improved Canary Crafting for Privacy Auditing in One Run
本文提出了一种通过优化“金丝雀”样本(用于测试模型是否记住特定数据的特殊样本)的方法,使其既容易被检测到,彼此之间又不相互干扰,从而在单次模型训练中更高效、更准确地评估机器学习模型的隐私泄露风险。
源自 arXiv: 2605.27292