语义篡改定位 / Semantic Manipulation Localization
1️⃣ 一句话总结
这篇论文提出了一个名为‘语义篡改定位’的新任务和对应的TRACE框架,专门用于检测图像中那些不明显但会改变图像含义的细微编辑,而不是依赖传统的篡改痕迹检测,从而在复杂的语义编辑场景中实现更准确、更完整的定位。
Image Manipulation Localization (IML) aims to identify edited regions in an image. However, with the increasing use of modern image editing and generative models, many manipulations no longer exhibit obvious low-level artifacts. Instead, they often involve subtle but meaning-altering edits to an object's attributes, state, or relationships while remaining highly consistent with the surrounding content. This makes conventional IML methods less effective because they mainly rely on artifact detection rather than semantic sensitivity. To address this issue, we introduce Semantic Manipulation Localization (SML), a new task that focuses on localizing subtle semantic edits that significantly change image interpretation. We further construct a dedicated fine-grained benchmark for SML using a semantics-driven manipulation pipeline with pixel-level annotations. Based on this task, we propose TRACE (Targeted Reasoning of Attributed Cognitive Edits), an end-to-end framework that models semantic sensitivity through three progressively coupled components: semantic anchoring, semantic perturbation sensing, and semantic-constrained reasoning. Specifically, TRACE first identifies semantically meaningful regions that support image understanding, then injects perturbation-sensitive frequency cues to capture subtle edits under strong visual consistency, and finally verifies candidate regions through joint reasoning over semantic content and semantic scope. Extensive experiments show that TRACE consistently outperforms existing IML methods on our benchmark and produces more complete, compact, and semantically coherent localization results. These results demonstrate the necessity of moving beyond artifact-based localization and provide a new direction for image forensics in complex semantic editing scenarios.
语义篡改定位 / Semantic Manipulation Localization
这篇论文提出了一个名为‘语义篡改定位’的新任务和对应的TRACE框架,专门用于检测图像中那些不明显但会改变图像含义的细微编辑,而不是依赖传统的篡改痕迹检测,从而在复杂的语义编辑场景中实现更准确、更完整的定位。
源自 arXiv: 2604.10132