菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-18
📄 Abstract - When Accuracy Is Not Enough: Uncertainty Collapse between Noisy Label Learning and Out-of-Distribution Detection

Learning with noisy labels (LNL) is typically benchmarked by closed-set classification accuracy, yet deployment often requires classifiers to reject out-of-distribution (OOD) inputs. We present a learner-agnostic ACC-OOD benchmark that freezes LNL checkpoints and evaluates them with standardized near-/far-OOD routing and post-hoc scores across synthetic and real label noise. The benchmark reveals a recurring failure mode: high closed-set accuracy does not ensure OOD reliability, because low-confidence, misclassified in-distribution samples can overlap the score and feature regions occupied by OOD inputs under noisy training. We term this pathology uncertainty collapse. This structural overlap can make high-accuracy LNL methods lose separability at the ID-error/OOD interface under standard OOD scores. As an intervention, we study Virtual Margin Regularization (VMR), a lightweight repair probe demonstrated mainly with PSSCL that synthesizes boundary virtual outliers on trusted ID batches and widens the energy margin. VMR partially reduces the collapse-induced far-OOD failure without replacing the host objective or sacrificing closed-set accuracy in the tested settings. These results support LNL benchmarks that co-report closed-set generalization, open-world reliability, and structural overlap diagnostics.

顶级标签: machine learning model evaluation
详细标签: noisy label learning out-of-distribution detection uncertainty collapse benchmark regularization 或 搜索:

当准确率不够时:噪声标签学习与分布外检测之间的不确定性崩塌 / When Accuracy Is Not Enough: Uncertainty Collapse between Noisy Label Learning and Out-of-Distribution Detection


1️⃣ 一句话总结

本文发现,在噪声标签学习任务中,高分类准确率并不能保证模型在检测分布外样本时的可靠性,因为被错误分类的低置信度样本与分布外样本在特征空间上会发生重叠,导致“不确定性崩塌”;作者提出了一个修复方法——虚拟边缘正则化,能在不牺牲准确率的前提下部分缓解这一问题。

源自 arXiv: 2605.17795