SIGMark:一种用于视频扩散模型的、支持盲提取的可扩展生成中水印技术 / SIGMark: Scalable In-Generation Watermark with Blind Extraction for Video Diffusion
1️⃣ 一句话总结
这篇论文提出了一种名为SIGMark的新方法,它能在AI生成视频的过程中直接嵌入看不见的水印,并且无需存储大量密钥就能高效地检测出来,同时还能有效抵抗视频在时间和空间上的修改,解决了现有技术成本高、不抗干扰的问题。
Artificial Intelligence Generated Content (AIGC), particularly video generation with diffusion models, has been advanced rapidly. Invisible watermarking is a key technology for protecting AI-generated videos and tracing harmful content, and thus plays a crucial role in AI safety. Beyond post-processing watermarks which inevitably degrade video quality, recent studies have proposed distortion-free in-generation watermarking for video diffusion models. However, existing in-generation approaches are non-blind: they require maintaining all the message-key pairs and performing template-based matching during extraction, which incurs prohibitive computational costs at scale. Moreover, when applied to modern video diffusion models with causal 3D Variational Autoencoders (VAEs), their robustness against temporal disturbance becomes extremely weak. To overcome these challenges, we propose SIGMark, a Scalable In-Generation watermarking framework with blind extraction for video diffusion. To achieve blind-extraction, we propose to generate watermarked initial noise using a Global set of Frame-wise PseudoRandom Coding keys (GF-PRC), reducing the cost of storing large-scale information while preserving noise distribution and diversity for distortion-free watermarking. To enhance robustness, we further design a Segment Group-Ordering module (SGO) tailored to causal 3D VAEs, ensuring robust watermark inversion during extraction under temporal disturbance. Comprehensive experiments on modern diffusion models show that SIGMark achieves very high bit-accuracy during extraction under both temporal and spatial disturbances with minimal overhead, demonstrating its scalability and robustness. Our project is available at this https URL.
SIGMark:一种用于视频扩散模型的、支持盲提取的可扩展生成中水印技术 / SIGMark: Scalable In-Generation Watermark with Blind Extraction for Video Diffusion
这篇论文提出了一种名为SIGMark的新方法,它能在AI生成视频的过程中直接嵌入看不见的水印,并且无需存储大量密钥就能高效地检测出来,同时还能有效抵抗视频在时间和空间上的修改,解决了现有技术成本高、不抗干扰的问题。
源自 arXiv: 2603.02882