菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-09
📄 Abstract - Evolution Strategy-Based Calibration for Low-Bit Quantization of Speech Models

Quantization has become essential for the efficient deployment of speech processing systems. Although widely studied, most existing quantization methods were developed for vision and NLP architectures, while the specific challenges of audio signals remain largely overlooked. In particular, we show that audio activations can exhibit large calibration ranges, leading to significant information loss when standard calibration techniques are applied. To address this, we propose ESC, an Evolution Strategy-based Calibration method that formulates activation scaling as an optimization problem and solves it using a two-step local-global scheme driven by an evolution strategy. ESC enables unaltered performance under full INT8 quantization and is the first calibration method to achieve near-lossless performance for full INT4 quantization across multiple speech tasks. Integrating ESC with PTQ methods further reduces performance loss, achieving a 1% relative accuracy degradation on the AST model.

顶级标签: audio model training machine learning
详细标签: quantization speech processing evolution strategy calibration low-bit precision 或 搜索:

基于进化策略的语音模型低比特量化校准方法 / Evolution Strategy-Based Calibration for Low-Bit Quantization of Speech Models


1️⃣ 一句话总结

本文提出了一种名为ESC的新方法,它利用进化策略来优化语音模型量化过程中的激活值缩放问题,从而在极低的INT4精度下,首次实现了跨多个语音任务的近乎无损性能,解决了现有量化技术因音频信号特性而导致的严重信息丢失难题。

源自 arXiv: 2603.08173