Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning

📄 Abstract - Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning

Existing 4D-driven video diffusion models primarily target plausible generation, but faithful 4D editing requires preserving source-observed regions while synthesizing disoccluded or out-of-view content. We identify Evidence-Role Mismatch: reliable source-backed evidence, unreliable rendered cues, and unsupported regions are entangled in a single conditioning signal, causing preservation drift, ghosting, and unstable extrapolation. We propose PREX (Preserve, Reveal, Expand), a region-aware framework that decomposes the target spatiotemporal volume into Preserve, Reveal, and Expand roles according to observation support and scene extent. PREX builds observation-backed appearance cues with calibrated confidence and injects them into a frozen video diffusion backbone through a region-aware adapter, trained with proxy tasks without requiring paired edited videos. We further introduce PREBench, a diagnostic benchmark with curated edits, region-role masks, and human-aligned metrics that complement global video-quality and 4D-control evaluations. Experiments show that PREX reduces region-structured failures while maintaining strong visual quality and 4D edit control capability. Project Page: this https URL

保留、揭示与扩展：基于区域感知的忠实4D视频编辑方法 / Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning

1️⃣ 一句话总结

本文提出一种名为PREX的4D视频编辑方法，通过将视频空间划分为需要保留、揭示和扩展三种区域，并分别给予不同的处理策略，从而在保持已有画面内容不变的同时，准确补全被遮挡或镜头外的部分，解决了现有方法常见的画面模糊、重影和内容漂移等问题。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要