菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-11
📄 Abstract - SCRAPL: Scattering Transform with Random Paths for Machine Learning

The Euclidean distance between wavelet scattering transform coefficients (known as paths) provides informative gradients for perceptual quality assessment of deep inverse problems in computer vision, speech, and audio processing. However, these transforms are computationally expensive when employed as differentiable loss functions for stochastic gradient descent due to their numerous paths, which significantly limits their use in neural network training. Against this problem, we propose "Scattering transform with Random Paths for machine Learning" (SCRAPL): a stochastic optimization scheme for efficient evaluation of multivariable scattering transforms. We implement SCRAPL for the joint time-frequency scattering transform (JTFS) which demodulates spectrotemporal patterns at multiple scales and rates, allowing a fine characterization of intermittent auditory textures. We apply SCRAPL to differentiable digital signal processing (DDSP), specifically, unsupervised sound matching of a granular synthesizer and the Roland TR-808 drum machine. We also propose an initialization heuristic based on importance sampling, which adapts SCRAPL to the perceptual content of the dataset, improving neural network convergence and evaluation performance. We make our code and audio samples available and provide SCRAPL as a Python package.

顶级标签: audio machine learning model training
详细标签: scattering transform stochastic optimization differentiable dsp sound matching perceptual loss 或 搜索:

SCRAPL:用于机器学习的随机路径散射变换 / SCRAPL: Scattering Transform with Random Paths for Machine Learning


1️⃣ 一句话总结

本文提出了一种名为SCRAPL的随机优化方法,通过随机采样路径来大幅降低散射变换的计算成本,使其能高效地作为神经网络的损失函数,从而提升在音频合成等任务中模型训练的效率和效果。

源自 arXiv: 2602.11145