GRAFT:通过全局特征归因审计图神经网络 / GRAFT: Auditing Graph Neural Networks via Global Feature Attribution
1️⃣ 一句话总结
本文提出了一种名为GRAFT的新方法,能够从全局角度解释图神经网络(GNN)在做节点分类时依赖哪些输入特征,并通过自动生成通俗易懂的文字规则,帮助用户理解模型决策、发现潜在偏见或提升模型迁移效果。
Graph Neural Networks (GNNs) achieve strong performance on node classification tasks but remain difficult to interpret, particularly with respect to which input features drive their predictions. Existing global GNN explainers operate at the structural level identifying recurring subgraph motifs, but none explain model behaviour globally at the level of input node attributes. We propose GRAFT, a posthoc global explanation framework that identifies class-level feature importance profiles for GNNs. The method combines diversity-guided exemplar selection, Integrated Gradients-based attribution, and aggregation to construct a global view of feature influence for each class, which can be further expressed as concise natural language rules using a large language model with self-refinement. We evaluate GRAFT across multiple datasets, architectures, and experimental settings, demonstrating its effectiveness in capturing model-relevant features, supporting bias analysis, and enabling feature-efficient transfer learning. In addition, we introduce a structured human evaluation protocol to assess the interpretability of generated rules along dimensions such as accuracy and usefulness. Our results suggest that GRAFT provides a practical and interpretable approach for analysing feature-level behaviour in GNNs, bridging quantitative attribution with human-understandable explanations.
GRAFT:通过全局特征归因审计图神经网络 / GRAFT: Auditing Graph Neural Networks via Global Feature Attribution
本文提出了一种名为GRAFT的新方法,能够从全局角度解释图神经网络(GNN)在做节点分类时依赖哪些输入特征,并通过自动生成通俗易懂的文字规则,帮助用户理解模型决策、发现潜在偏见或提升模型迁移效果。
源自 arXiv: 2605.03377