菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-20
📄 Abstract - Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning

Existing 4D-driven video diffusion models primarily target plausible generation, but faithful 4D editing requires preserving source-observed regions while synthesizing disoccluded or out-of-view content. We identify Evidence-Role Mismatch: reliable source-backed evidence, unreliable rendered cues, and unsupported regions are entangled in a single conditioning signal, causing preservation drift, ghosting, and unstable extrapolation. We propose PREX (Preserve, Reveal, Expand), a region-aware framework that decomposes the target spatiotemporal volume into Preserve, Reveal, and Expand roles according to observation support and scene extent. PREX builds observation-backed appearance cues with calibrated confidence and injects them into a frozen video diffusion backbone through a region-aware adapter, trained with proxy tasks without requiring paired edited videos. We further introduce PREBench, a diagnostic benchmark with curated edits, region-role masks, and human-aligned metrics that complement global video-quality and 4D-control evaluations. Experiments show that PREX reduces region-structured failures while maintaining strong visual quality and 4D edit control capability. Project Page: this https URL

顶级标签: computer vision video model training
详细标签: 4d video editing faithful editing region-aware disocclusion benchmark 或 搜索:

保留、揭示与扩展:基于区域感知的忠实4D视频编辑方法 / Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning


1️⃣ 一句话总结

本文提出一种名为PREX的4D视频编辑方法,通过将视频空间划分为需要保留、揭示和扩展三种区域,并分别给予不同的处理策略,从而在保持已有画面内容不变的同时,准确补全被遮挡或镜头外的部分,解决了现有方法常见的画面模糊、重影和内容漂移等问题。

源自 arXiv: 2605.20961