Stream3D:基于证据记忆的序列化多视角三维生成 / Stream3D: Sequential Multi-View 3D Generation via Evidential Memory
1️⃣ 一句话总结
Stream3D提出了一种无需重新训练的流式生成机制,通过一个紧凑的证据记忆模块,从不断变化的单目视频流中智能选择最关键的帧,使得已有的单视角3D生成器能够稳定、一致地生成连续的三维对象,解决了长序列下内存爆炸和结果跳变的问题。
View-conditioned 3D generators such as SAM 3D, TRELLIS and Hunyuan3D produce high-quality object reconstructions from a single view, but real-world visual observation often arrives as long monocular streams. Naively applying these generators to each streaming frame independently leads to severe temporal inconsistency in the generated results. To address this problem, we propose Stream3D, the first training-free streaming mechanism that turns a frozen view-conditioned 3D generator into a streaming generator with constant cross-chunk memory. Stream3D achieves this by maintaining a compact evidential memory, which selectively caches the most informative historical frames based on a proposed evidence score mechanism. As the stream progresses, the memory dynamically updates to retain a fixed number of informative frames, preventing the memory footprint from growing linearly with sequence length. This also prevents degradation over long sequences and keeps the underlying generator completely unchanged without retraining, architectural modifications, or auxiliary losses. Evaluated on both realistic and synthetic streaming benchmarks, Stream3D outperforms latent-transport baselines, including KV-cache reuse and flow-based feature editing, across both photometric and geometric metrics. More details can be found at: this https URL.
Stream3D:基于证据记忆的序列化多视角三维生成 / Stream3D: Sequential Multi-View 3D Generation via Evidential Memory
Stream3D提出了一种无需重新训练的流式生成机制,通过一个紧凑的证据记忆模块,从不断变化的单目视频流中智能选择最关键的帧,使得已有的单视角3D生成器能够稳定、一致地生成连续的三维对象,解决了长序列下内存爆炸和结果跳变的问题。
源自 arXiv: 2605.21472