扩散积分梯度:面向灵活特征归因的可控路径生成 / Diffusion Integrated Gradients: Controllable Path Generation for Flexible Feature Attribution
1️⃣ 一句话总结
本文提出了一种名为扩散积分梯度(DiffIG)的新方法,利用扩散模型自动生成最优归因路径,替代传统固定路径,从而更灵活、更清晰地解释AI模型的预测依据。
Path-based attribution methods such as Integrated Gradients (IG) are widely adopted for their strong axiomatic properties and effectiveness in attributing model predictions to input features by integrating gradients along a path from a baseline to the input. However, the choice of the attribution path largely affects the quality of explanations, and existing approaches rely on fixed or hand-crafted paths that often produce noisy or distorted attributions. To address this limitation, we propose Diffusion Integrated Gradients (DiffIG), a novel method that reformulates path generation as a conditional generative modeling problem. DiffIG first trains a diffusion model to learn a distribution over paths generated from a Stick-Breaking Process, then employs guided sampling to embed user guidance during the sampling procedure. We demonstrate that DiffIG quantitatively matches or outperforms existing path-based methods, achieving perceptually aligned explanations. This work introduces a new generative perspective for flexible, inference-time controllable Explainable Artificial Intelligence (XAI) methods.
扩散积分梯度:面向灵活特征归因的可控路径生成 / Diffusion Integrated Gradients: Controllable Path Generation for Flexible Feature Attribution
本文提出了一种名为扩散积分梯度(DiffIG)的新方法,利用扩散模型自动生成最优归因路径,替代传统固定路径,从而更灵活、更清晰地解释AI模型的预测依据。
源自 arXiv: 2606.22314