保留、揭示与扩展:基于区域感知的忠实4D视频编辑方法 / Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning
1️⃣ 一句话总结
本文提出一种名为PREX的4D视频编辑方法,通过将视频空间划分为需要保留、揭示和扩展三种区域,并分别给予不同的处理策略,从而在保持已有画面内容不变的同时,准确补全被遮挡或镜头外的部分,解决了现有方法常见的画面模糊、重影和内容漂移等问题。
Existing 4D-driven video diffusion models primarily target plausible generation, but faithful 4D editing requires preserving source-observed regions while synthesizing disoccluded or out-of-view content. We identify Evidence-Role Mismatch: reliable source-backed evidence, unreliable rendered cues, and unsupported regions are entangled in a single conditioning signal, causing preservation drift, ghosting, and unstable extrapolation. We propose PREX (Preserve, Reveal, Expand), a region-aware framework that decomposes the target spatiotemporal volume into Preserve, Reveal, and Expand roles according to observation support and scene extent. PREX builds observation-backed appearance cues with calibrated confidence and injects them into a frozen video diffusion backbone through a region-aware adapter, trained with proxy tasks without requiring paired edited videos. We further introduce PREBench, a diagnostic benchmark with curated edits, region-role masks, and human-aligned metrics that complement global video-quality and 4D-control evaluations. Experiments show that PREX reduces region-structured failures while maintaining strong visual quality and 4D edit control capability. Project Page: this https URL
保留、揭示与扩展:基于区域感知的忠实4D视频编辑方法 / Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning
本文提出一种名为PREX的4D视频编辑方法,通过将视频空间划分为需要保留、揭示和扩展三种区域,并分别给予不同的处理策略,从而在保持已有画面内容不变的同时,准确补全被遮挡或镜头外的部分,解决了现有方法常见的画面模糊、重影和内容漂移等问题。
源自 arXiv: 2605.20961