迈向基于多智能体推理的上下文感知图像匿名化 / Towards Context-Aware Image Anonymization with Multi-Agent Reasoning
1️⃣ 一句话总结
这篇论文提出了一种名为CAIAMAR的新型智能体框架,它能够根据图像中物体所处的空间环境(如私人或公共区域)来智能判断并隐藏人脸等个人身份信息,在有效保护隐私的同时,比现有方法更好地保持了图像质量,并且完全在本地运行以满足数据安全法规要求。
Street-level imagery contains personally identifiable information (PII), some of which is context-dependent. Existing anonymization methods either over-process images or miss subtle identifiers, while API-based solutions compromise data sovereignty. We present an agentic framework CAIAMAR (\underline{C}ontext-\underline{A}ware \underline{I}mage \underline{A}nonymization with \underline{M}ulti-\underline{A}gent \underline{R}easoning) for context-aware PII segmentation with diffusion-based anonymization, combining pre-defined processing for high-confidence cases with multi-agent reasoning for indirect identifiers. Three specialized agents coordinate via round-robin speaker selection in a Plan-Do-Check-Act (PDCA) cycle, enabling large vision-language models to classify PII based on spatial context (private vs. public property) rather than rigid category rules. The agents implement spatially-filtered coarse-to-fine detection where a scout-and-zoom strategy identifies candidates, open-vocabulary segmentation processes localized crops, and $IoU$-based deduplication ($30\%$ threshold) prevents redundant processing. Modal-specific diffusion guidance with appearance decorrelation substantially reduces re-identification (Re-ID) risks. On CUHK03-NP, our method reduces person Re-ID risk by $73\%$ ($R1$: $16.9\%$ vs. $62.4\%$ baseline). For image quality preservation on CityScapes, we achieve KID: $0.001$, and FID: $9.1$, significantly outperforming existing anonymization. The agentic workflow detects non-direct PII instances across object categories, and downstream semantic segmentation is preserved. Operating entirely on-premise with open-source models, the framework generates human-interpretable audit trails supporting EU's GDPR transparency requirements while flagging failed cases for human review.
迈向基于多智能体推理的上下文感知图像匿名化 / Towards Context-Aware Image Anonymization with Multi-Agent Reasoning
这篇论文提出了一种名为CAIAMAR的新型智能体框架,它能够根据图像中物体所处的空间环境(如私人或公共区域)来智能判断并隐藏人脸等个人身份信息,在有效保护隐私的同时,比现有方法更好地保持了图像质量,并且完全在本地运行以满足数据安全法规要求。
源自 arXiv: 2603.27817