通过决策模式转移理解泛化 / Understanding Generalization through Decision Pattern Shift
1️⃣ 一句话总结
本文提出一种新视角“决策模式转移”(DPS),通过衡量模型内部特征通道的决策模式在训练与测试之间的稳定性来量化泛化失败,发现DPS大小与泛化差距线性相关,并能将多种性能下降现象(如分布偏移、捷径学习等)统一为连续轨迹,为早期风险检测和故障诊断提供新方法。
Understanding why deep neural networks (DNNs) fail to generalize to unseen samples remains a long-standing challenge. Existing studies mainly examine changes in externally observable factors such as data, representations, or outputs, yet offer limited insight into how a model's internal decision mechanism evolves from training to test. To address this gap, we introduce Decision Pattern Shift (DPS), a new perspective that defines generalization through the stability of internal decision patterns and quantifies failure as their deviation from those learned during training. Specifically, we represent each sample's decision pattern as a GradCAM-based channel-contribution vector, which captures how feature channels collectively support a prediction, and we propose the DPS metric to measure its discrepancy from the class-average pattern. Empirical analyses across multiple datasets and architectures show that, (i) decision patterns form a highly structured, class-consistent space with strong intra-class cohesion and low inter-class confusion, enabling direct analysis of a model's decision logic; (ii) the DPS magnitude correlates linearly with the generalization gap (nearly all Pearson r > 0.8), revealing generalization as a systematic drift in the model's internal decision mechanism; (iii) the DPS spectrum organizes diverse generalization degradation scenarios (covering ideal generalization, in-distribution degradation, domain shift, out-of-distribution, and shortcut learning) into a continuous trajectory, providing a unified explanation of their failure modes. These findings open up new possibilities for early generalization-risk detection, failure-mode diagnosis, and channel-level defect localization.
通过决策模式转移理解泛化 / Understanding Generalization through Decision Pattern Shift
本文提出一种新视角“决策模式转移”(DPS),通过衡量模型内部特征通道的决策模式在训练与测试之间的稳定性来量化泛化失败,发现DPS大小与泛化差距线性相关,并能将多种性能下降现象(如分布偏移、捷径学习等)统一为连续轨迹,为早期风险检测和故障诊断提供新方法。
源自 arXiv: 2605.13148