混合比例估计与条件独立的弱监督核检验 / Mixture Proportion Estimation and Weakly-supervised Kernel Test for Conditional Independence
1️⃣ 一句话总结
这篇论文提出了一种新的方法,通过假设类别标签条件下的独立性,来更准确地从未标记数据中估计各类别的比例,并开发了相应的统计检验来验证这个假设,从而改进了弱监督学习中的关键问题。
Mixture proportion estimation (MPE) aims to estimate class priors from unlabeled data. This task is a critical component in weakly supervised learning, such as PU learning, learning with label noise, and domain adaptation. Existing MPE methods rely on the \textit{irreducibility} assumption or its variant for identifiability. In this paper, we propose novel assumptions based on conditional independence (CI) given the class label, which ensure identifiability even when irreducibility does not hold. We develop method of moments estimators under these assumptions and analyze their asymptotic properties. Furthermore, we present weakly-supervised kernel tests to validate the CI assumptions, which are of independent interest in applications such as causal discovery and fairness evaluation. Empirically, we demonstrate the improved performance of our estimators compared with existing methods and that our tests successfully control both type I and type II errors.\label{key}
混合比例估计与条件独立的弱监督核检验 / Mixture Proportion Estimation and Weakly-supervised Kernel Test for Conditional Independence
这篇论文提出了一种新的方法,通过假设类别标签条件下的独立性,来更准确地从未标记数据中估计各类别的比例,并开发了相应的统计检验来验证这个假设,从而改进了弱监督学习中的关键问题。
源自 arXiv: 2604.07191