菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-23
📄 Abstract - Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs

Existing visual privacy benchmarks largely treat privacy as a binary property, labeling images as private or non-private based on visible sensitive content. We argue that privacy is fundamentally compositional. Attributes that are benign in isolation may combine to produce severe privacy violations. We introduce the Compositional Privacy Risk Taxonomy (CPRT), a regulation-aware framework that organizes visual attributes according to standalone identifiability and compositional harm potential. CPRT defines four graded severity levels and is paired with an interpretable scoring function that assigns continuous privacy severity scores. We further construct a taxonomy-aligned dataset of 6.7K images and derive ground-truth compositional risk scores. By evaluating frontier and open-weight VLMs we find that frontier models align well with compositional severity when provided structured guidance, but systematically underestimate composition-driven risks. Smaller models struggle to internalize graded privacy reasoning. To bridge this gap, we introduce a deployable 8B supervised fine-tuned (SFT) model that closely matches frontier-level performance on compositional privacy assessment.

顶级标签: multi-modal model evaluation computer vision
详细标签: visual privacy compositional risk vlm evaluation privacy taxonomy severity assessment 或 搜索:

重新思考视觉隐私:一个基于视觉语言模型的组合隐私风险框架与严重性评估 / Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs


1️⃣ 一句话总结

这篇论文提出了一个组合式隐私风险框架,认为隐私风险不是非黑即白的,而是由多种视觉元素组合决定的,并开发了一个能评估隐私风险严重程度的新模型。

源自 arXiv: 2603.21573