保留伪造痕迹:原生尺度下的AI生成视频检测 / Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale
1️⃣ 一句话总结
这篇论文针对现有AI生成视频检测方法会因固定尺寸预处理而丢失关键伪造痕迹的问题,提出了一个包含海量视频的新数据集和一个能在视频原始分辨率下直接分析、从而有效保留高频伪造特征的新型检测框架,显著提升了检测准确率。
The rapid advancement of video generation models has enabled the creation of highly realistic synthetic media, raising significant societal concerns regarding the spread of misinformation. However, current detection methods suffer from critical limitations. They rely on preprocessing operations like fixed-resolution resizing and cropping. These operations not only discard subtle, high-frequency forgery traces but also cause spatial distortion and significant information loss. Furthermore, existing methods are often trained and evaluated on outdated datasets that fail to capture the sophistication of modern generative models. To address these challenges, we introduce a comprehensive dataset and a novel detection framework. First, we curate a large-scale dataset of over 140K videos from 15 state-of-the-art open-source and commercial generators, along with Magic Videos benchmark designed specifically for evaluating ultra-realistic synthetic content. In addition, we propose a novel detection framework built on the Qwen2.5-VL Vision Transformer, which operates natively at variable spatial resolutions and temporal durations. This native-scale approach effectively preserves the high-frequency artifacts and spatiotemporal inconsistencies typically lost during conventional preprocessing. Extensive experiments demonstrate that our method achieves superior performance across multiple benchmarks, underscoring the critical importance of native-scale processing and establishing a robust new baseline for AI-generated video detection.
保留伪造痕迹:原生尺度下的AI生成视频检测 / Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale
这篇论文针对现有AI生成视频检测方法会因固定尺寸预处理而丢失关键伪造痕迹的问题,提出了一个包含海量视频的新数据集和一个能在视频原始分辨率下直接分析、从而有效保留高频伪造特征的新型检测框架,显著提升了检测准确率。
源自 arXiv: 2604.04634