分离-效用帕累托前沿:一种信息论视角下的特征刻画 / Separation-Utility Pareto Frontier: An Information-Theoretic Characterization
1️⃣ 一句话总结
这篇论文从信息论的角度,首次系统刻画了机器学习模型在预测准确性和公平性之间的最优权衡边界,并提出了一种易于实现的正则化方法,能在保证模型性能的同时有效减少对敏感属性的依赖。
We study the Pareto frontier (optimal trade-off) between utility and separation, a fairness criterion requiring predictive independence from sensitive attributes conditional on the true outcome. Through an information-theoretic lens, we prove a characterization of the utility-separation Pareto frontier, establish its concavity, and thereby prove the increasing marginal cost of separation in terms of utility. In addition, we characterize the conditions under which this trade-off becomes strict, providing a guide for trade-off selection in practice. Based on the theoretical characterization, we develop an empirical regularizer based on conditional mutual information (CMI) between predictions and sensitive attributes given the true outcome. The CMI regularizer is compatible with any deep model trained via gradient-based optimization and serves as a scalar monitor of residual separation violations, offering tractable guarantees during training. Finally, numerical experiments support our theoretical findings: across COMPAS, UCI Adult, UCI Bank, and CelebA, the proposed method substantially reduces separation violations while matching or exceeding the utility of established baseline methods. This study thus offers a provable, stable, and flexible approach to enforcing separation in deep learning.
分离-效用帕累托前沿:一种信息论视角下的特征刻画 / Separation-Utility Pareto Frontier: An Information-Theoretic Characterization
这篇论文从信息论的角度,首次系统刻画了机器学习模型在预测准确性和公平性之间的最优权衡边界,并提出了一种易于实现的正则化方法,能在保证模型性能的同时有效减少对敏感属性的依赖。
源自 arXiv: 2602.04408