菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-19
📄 Abstract - Computational and Statistical Hardness of Calibration Distance

The distance from calibration, introduced by Błasiok, Gopalan, Hu, and Nakkiran (STOC 2023), has recently emerged as a central measure of miscalibration for probabilistic predictors. We study the fundamental problems of computing and estimating this quantity, given either an exact description of the data distribution or only sample access to it. We give an efficient algorithm that exactly computes the calibration distance when the distribution has a uniform marginal and noiseless labels, which improves the $O(1/\sqrt{|\mathcal{X}|})$ additive approximation of Qiao and Zheng (COLT 2024) for this special case. Perhaps surprisingly, the problem becomes $\mathsf{NP}$-hard when either of the two assumptions is removed. We extend our algorithm to a polynomial-time approximation scheme for the general case. For the estimation problem, we show that $\Theta(1/\epsilon^3)$ samples are sufficient and necessary for the empirical calibration distance to be upper bounded by the true distance plus $\epsilon$. In contrast, a polynomial dependence on the domain size -- incurred by the learning-based baseline -- is unavoidable for two-sided estimation. Our positive results are based on simple sparsifications of both the distribution and the target predictor, which significantly reduce the search space for computation and lead to stronger concentration for the estimation problem. To prove the hardness results, we introduce new techniques for certifying lower bounds on the calibration distance -- a problem that is hard in general due to its $\textsf{co-NP}$-completeness.

顶级标签: theory machine learning model evaluation
详细标签: calibration distance computational complexity statistical estimation np-hardness probabilistic predictors 或 搜索:

校准距离的计算与统计难度 / Computational and Statistical Hardness of Calibration Distance


1️⃣ 一句话总结

这篇论文研究了衡量概率预测器校准误差的核心指标——校准距离的计算和估计问题,发现在理想情况下可以高效计算,但在更一般的设定下计算是NP难的,并给出了高效的近似算法以及估计该距离所需的样本复杂度界限。

源自 arXiv: 2603.18391