基于代理模型的Shapley和Banzhaf交互作用近似方法 / Proxy-Based Approximation of Shapley and Banzhaf Interactions
1️⃣ 一句话总结
本文提出了一种名为ProxySHAP的新方法,通过结合树模型的高效性与残差校正技术,在兼顾速度和精度的前提下,首次实现了对复杂机器学习模型中高阶特征交互作用的快速准确估计,大幅优于现有方法。
Shapley and Banzhaf interactions capture the complex dynamics inherent in modern machine learning applications. However, current estimators for these higher-order interactions trade off between speed and accuracy. To overcome this limitation, we introduce ProxySHAP. ProxySHAP reconciles the high sample efficiency of tree-based proxy models with a principled path to consistency via residual correction. On a theoretical level, we derive a polynomial-time generalization of interventional TreeSHAP to compute exact interaction indices for tree ensembles, successfully bypassing exponential tree-depth dependencies in prior methods. Furthermore, we formally analyze the residual adjustment strategy, characterizing the specific conditions under which Maximum Sample Reuse (MSR) corrects proxy bias without its variance scaling exponentially with interaction size. Extensive benchmarking demonstrates that ProxySHAP sets a new state-of-the-art standard for approximation quality, including in large-scale applications with thousands of features. By achieving the lowest error in both small- and large-budget regimes, ProxySHAP significantly outperforms the prior best estimators ProxySPEX and KernelSHAP-IQ, while also delivering superior performance on downstream explainability tasks.
基于代理模型的Shapley和Banzhaf交互作用近似方法 / Proxy-Based Approximation of Shapley and Banzhaf Interactions
本文提出了一种名为ProxySHAP的新方法,通过结合树模型的高效性与残差校正技术,在兼顾速度和精度的前提下,首次实现了对复杂机器学习模型中高阶特征交互作用的快速准确估计,大幅优于现有方法。
源自 arXiv: 2605.22738