菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-26
📄 Abstract - CIAR: Interval-based Collaborative Decoding for Image Generation Acceleration

Auto-regressive (AR) models have recently made notable progress in image generation, achieving performance comparable to diffusion-based approaches. However, their computational intensity and sequential nature impede on-device deployment, causing disruptive latency. We address this via a cloud-device collaboration framework \textbf{CIAR}, which utilizes on-device self-verification to handle two key properties of visual synthesis: \textit{the vast token vocabulary} required for high-fidelity images and \textit{inherent spatial redundancy} which leads to extreme predictability in homogeneous regions, while object boundaries exhibit high uncertainty. Uniform verification wastes resources on such redundant tokens. Our solution centers on an on-device token uncertainty quantifier, which adopts continuous probability intervals to accelerate processing and make it feasible for large visual vocabularies instead of conventional discrete solution sets. Additionally, we incorporate a Interval-enhanced decoding module to further speed up decoding while maintaining visual fidelity and semantic consistency via a distribution alignment training strategy. Extensive experiments demonstrate that CIAR achieves a 2.18x speed-up and reduces cloud requests by 70\%, while preserving image quality compared to existing methods.

顶级标签: model training systems computer vision
详细标签: autoregressive models image generation inference acceleration cloud-device collaboration uncertainty quantification 或 搜索:

CIAR:基于区间的协同解码用于图像生成加速 / CIAR: Interval-based Collaborative Decoding for Image Generation Acceleration


1️⃣ 一句话总结

这篇论文提出了一种名为CIAR的云-端协同框架,通过在设备端使用连续概率区间来量化图像生成过程中不同区域的不确定性,从而大幅加速自回归模型的图像生成速度,同时减少对云端计算的依赖并保持图像质量。

源自 arXiv: 2603.25463