DreamStereo:面向高清视频的实时立体图像修复 / DreamStereo: Towards Real-Time Stereo Inpainting for HD Videos
1️⃣ 一句话总结
这篇论文提出了一种名为DreamStereo的新方法,通过创新的梯度感知视差扭曲和稀疏感知的修复技术,解决了立体视频修复中数据稀缺和计算冗余两大难题,首次实现了在单个A100 GPU上以25帧/秒的速度实时处理高清立体视频。
Stereo video inpainting, which aims to fill the occluded regions of warped videos with visually coherent content while maintaining temporal consistency, remains a challenging open problem. The regions to be filled are scattered along object boundaries and occupy only a small fraction of each frame, leading to two key challenges. First, existing approaches perform poorly on such tasks due to the scarcity of high-quality stereo inpainting datasets, which limits their ability to learn effective inpainting priors. Second, these methods apply equal processing to all regions of the frame, even though most pixels require no modification, resulting in substantial redundant computation. To address these issues, we introduce three interconnected components. We first propose Gradient-Aware Parallax Warping (GAPW), which leverages backward warping and the gradient of the coordinate mapping function to obtain continuous edges and smooth occlusion regions. Then, a Parallax-Based Dual Projection (PBDP) strategy is introduced, which incorporates GAPW to produce geometrically consistent stereo inpainting pairs and accurate occlusion masks without requiring stereo videos. Finally, we present Sparsity-Aware Stereo Inpainting (SASI), which reduces over 70% of redundant tokens, achieving a 10.7x speedup during diffusion inference and delivering results comparable to its full-computation counterpart, enabling real-time processing of HD (768 x 1280) videos at 25 FPS on a single A100 GPU.
DreamStereo:面向高清视频的实时立体图像修复 / DreamStereo: Towards Real-Time Stereo Inpainting for HD Videos
这篇论文提出了一种名为DreamStereo的新方法,通过创新的梯度感知视差扭曲和稀疏感知的修复技术,解决了立体视频修复中数据稀缺和计算冗余两大难题,首次实现了在单个A100 GPU上以25帧/秒的速度实时处理高清立体视频。
源自 arXiv: 2604.12270