基于平滑性的PAC-Bayes界去随机化方法 / Smoothness-Based Derandomization of PAC-Bayes Bounds
1️⃣ 一句话总结
本文提出了一种利用损失函数和预测器平滑性将PAC-Bayes概率上界转化为确定性预测器高概率泛化界的方法,并通过引入基于参数雅可比矩阵和海森矩阵的正则化项来提升神经网络的实际泛化性能,实验表明该方法对批量归一化网络有效。
We study PAC-Bayes derandomization for smooth loss functions. Our goal is to obtain generalization bounds that hold with high probability for deterministic predictors by exploiting smoothness properties of both the loss and the predictor class. We show that passing from the Gibbs predictor to the deterministic predictor at the posterior mean has a precise cost, given by the generalization gap of the Jensen gap class. We control this class through its Rademacher complexity, leading to bounds for deterministic predictors that involve flatness quantities expressed in terms of parameter Jacobians and Hessians of the score map. The framework applies to both bounded and unbounded smooth loss functions, and we specialize the results to linear predictors and smooth neural networks. Finally, the Jacobian and Hessian quantities appearing in the theory motivate a practical regularizer. For BatchNorm networks, we compute this regularizer with respect to effective BatchNorm weights obtained by folding the BatchNorm transformation into the adjacent affine weights. Experiments on CIFAR-10 illustrate the behavior of this regularizer under different batch sizes.
基于平滑性的PAC-Bayes界去随机化方法 / Smoothness-Based Derandomization of PAC-Bayes Bounds
本文提出了一种利用损失函数和预测器平滑性将PAC-Bayes概率上界转化为确定性预测器高概率泛化界的方法,并通过引入基于参数雅可比矩阵和海森矩阵的正则化项来提升神经网络的实际泛化性能,实验表明该方法对批量归一化网络有效。
源自 arXiv: 2606.19105