保持源视频真实感:面向电影级质量的高保真人脸交换 / Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
1️⃣ 一句话总结
这篇论文提出了一种名为LivingSwap的新方法,它通过利用源视频的视觉特征和关键帧引导,首次实现了在长视频中高质量、高保真且时间连贯的人脸替换,显著提升了电影制作中的真实感和效率。
Video face swapping is crucial in film and entertainment production, where achieving high fidelity and temporal consistency over long and complex video sequences remains a significant challenge. Inspired by recent advances in reference-guided image editing, we explore whether rich visual attributes from source videos can be similarly leveraged to enhance both fidelity and temporal coherence in video face swapping. Building on this insight, this work presents LivingSwap, the first video reference guided face swapping model. Our approach employs keyframes as conditioning signals to inject the target identity, enabling flexible and controllable editing. By combining keyframe conditioning with video reference guidance, the model performs temporal stitching to ensure stable identity preservation and high-fidelity reconstruction across long video sequences. To address the scarcity of data for reference-guided training, we construct a paired face-swapping dataset, Face2Face, and further reverse the data pairs to ensure reliable ground-truth supervision. Extensive experiments demonstrate that our method achieves state-of-the-art results, seamlessly integrating the target identity with the source video's expressions, lighting, and motion, while significantly reducing manual effort in production workflows. Project webpage: this https URL
保持源视频真实感:面向电影级质量的高保真人脸交换 / Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
这篇论文提出了一种名为LivingSwap的新方法,它通过利用源视频的视觉特征和关键帧引导,首次实现了在长视频中高质量、高保真且时间连贯的人脸替换,显著提升了电影制作中的真实感和效率。
源自 arXiv: 2512.07951