菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-24
📄 Abstract - HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

High-resolution video generation, while crucial for digital media and film, is computationally bottlenecked by the quadratic complexity of diffusion models, making practical inference infeasible. To address this, we introduce HiStream, an efficient autoregressive framework that systematically reduces redundancy across three axes: i) Spatial Compression: denoising at low resolution before refining at high resolution with cached features; ii) Temporal Compression: a chunk-by-chunk strategy with a fixed-size anchor cache, ensuring stable inference speed; and iii) Timestep Compression: applying fewer denoising steps to subsequent, cache-conditioned chunks. On 1080p benchmarks, our primary HiStream model (i+ii) achieves state-of-the-art visual quality while demonstrating up to 76.2x faster denoising compared to the Wan2.1 baseline and negligible quality loss. Our faster variant, HiStream+, applies all three optimizations (i+ii+iii), achieving a 107.5x acceleration over the baseline, offering a compelling trade-off between speed and quality, thereby making high-resolution video generation both practical and scalable.

顶级标签: video generation aigc model training
详细标签: high-resolution video computational efficiency diffusion models latent diffusion real-time generation 或 搜索:

HiStream:一种高效的高分辨率视频生成框架 / HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming


1️⃣ 一句话总结

本文提出了HiStream框架,通过双分辨率缓存、锚点引导滑动窗口和非对称去噪等核心创新,在保持高视觉质量的同时,显著降低了高分辨率视频生成的计算成本和延迟,实现了高达107.5倍的加速。


2️⃣ 论文创新点

1. 双分辨率缓存

2. 锚点引导滑动窗口

3. 非对称去噪策略

4. HiStream+ 加速变体


3️⃣ 主要结果与价值

结果亮点

实际价值


4️⃣ 术语表

源自 arXiv: 2512.21338