面向AI生成图像检测的频率感知语义融合与门控注入方法 / Frequency-Aware Semantic Fusion with Gated Injection for AI-generated Image Detection
1️⃣ 一句话总结
本文提出了一种名为FGINet的检测方法,通过设计频带遮蔽编码器减少对特定生成器特征的依赖,并利用分层门控注入机制将频域线索逐步融入视觉大模型,从而有效识别AI生成的图像,在不同生成模型下均能保持高准确率和强泛化能力。
AI-generated images are becoming increasingly realistic and diverse, posing significant challenges for generalizable detection. While Vision Foundation Models (VFMs) provide rich semantic representations and frequency-based methods capture complementary artifact cues, existing approaches that combine these modalities still suffer from limited generalization, with notable performance degradation on unseen generative models. We attribute this limitation to two key factors: frequency shortcut bias toward easily distinguishable cues associated with specific generators and cross-domain representation conflict between high-level semantics and low-level frequency patterns. To address these issues, we propose a Frequency-aware Gated Injection Network (FGINet) to improve generalization. Specifically, we design a Band-Masked Frequency Encoder (BMFE) that applies cross-band masking in the frequency domain to reduce reliance on generator-specific patterns and encourage more diverse and generalizable representations. We further introduce a Layer-wise Gated Frequency Injection (LGFI) mechanism to progressively inject frequency cues into the VFM backbone with adaptive gating, aligning with its hierarchical abstraction and alleviating representation conflict. Moreover, we propose a Hyperspherical Compactness Learning (HCL) framework with a cosine margin objective to learn compact and well-separated representations. Extensive experiments demonstrate that FGINet achieves state-of-the-art performance and strong generalization across multiple challenging datasets.
面向AI生成图像检测的频率感知语义融合与门控注入方法 / Frequency-Aware Semantic Fusion with Gated Injection for AI-generated Image Detection
本文提出了一种名为FGINet的检测方法,通过设计频带遮蔽编码器减少对特定生成器特征的依赖,并利用分层门控注入机制将频域线索逐步融入视觉大模型,从而有效识别AI生成的图像,在不同生成模型下均能保持高准确率和强泛化能力。
源自 arXiv: 2604.27875