菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-16
📄 Abstract - Spherical Leech Quantization for Visual Tokenization and Generation

Non-parametric quantization has received much attention due to its efficiency on parameters and scalability to a large codebook. In this paper, we present a unified formulation of different non-parametric quantization methods through the lens of lattice coding. The geometry of lattice codes explains the necessity of auxiliary loss terms when training auto-encoders with certain existing lookup-free quantization variants such as BSQ. As a step forward, we explore a few possible candidates, including random lattices, generalized Fibonacci lattices, and densest sphere packing lattices. Among all, we find the Leech lattice-based quantization method, which is dubbed as Spherical Leech Quantization ($\Lambda_{24}$-SQ), leads to both a simplified training recipe and an improved reconstruction-compression tradeoff thanks to its high symmetry and even distribution on the hypersphere. In image tokenization and compression tasks, this quantization approach achieves better reconstruction quality across all metrics than BSQ, the best prior art, while consuming slightly fewer bits. The improvement also extends to state-of-the-art auto-regressive image generation frameworks.

顶级标签: computer vision model training machine learning
详细标签: vector quantization lattice coding image tokenization auto-encoder leeck lattice 或 搜索:

用于视觉标记化与生成的球形Leech量化方法 / Spherical Leech Quantization for Visual Tokenization and Generation


1️⃣ 一句话总结

这篇论文提出了一种基于高对称性Leech晶格的图像量化新方法,它通过简化训练流程并优化压缩与重建的平衡,在图像压缩和生成任务中取得了比现有最佳技术更好的效果。


源自 arXiv: 2512.14697