菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-03
📄 Abstract - Differentiable Time-Varying IIR Filtering for Real-Time Speech Denoising

We present TVF (Time-Varying Filtering), a low-latency speech enhancement model with 1 million parameters. Combining the interpretability of Digital Signal Processing (DSP) with the adaptability of deep learning, TVF bridges the gap between traditional filtering and modern neural speech modeling. The model utilizes a lightweight neural network backbone to predict the coefficients of a differentiable 35-band IIR filter cascade in real time, allowing it to adapt dynamically to non-stationary noise. Unlike ``black-box'' deep learning approaches, TVF offers a completely interpretable processing chain, where spectral modifications are explicit and adjustable. We demonstrate the efficacy of this approach on a speech denoising task using the Valentini-Botinhao dataset and compare the results to a static DDSP approach and a fully deep-learning-based solution, showing that TVF achieves effective adaptation to changing noise conditions.

顶级标签: audio machine learning model training
详细标签: speech denoising iir filtering differentiable dsp real-time processing interpretable models 或 搜索:

用于实时语音降噪的可微分时变IIR滤波 / Differentiable Time-Varying IIR Filtering for Real-Time Speech Denoising


1️⃣ 一句话总结

这篇论文提出了一种名为TVF的低延迟、可解释的语音增强模型,它通过一个轻量级神经网络实时预测滤波器系数,将传统信号处理的透明性与深度学习的适应性相结合,从而有效应对动态变化的噪声环境。

源自 arXiv: 2603.02794