菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-07-02
📄 Abstract - Algebraic Model Counting for Global Analysis of Optimal Decision Trees

Ensuring model reliability in Explainable AI requires a global assessment of the hypothesis space. We propose a formal framework for the exhaustive analysis of optimal and near-optimal decision trees, called Algebraic Decision Tree Counting (ADTC). Inspired by Algebraic Model Counting (AMC) in knowledge representation, ADTC reformulates diverse analytical tasks, such as optimization, counting, and sampling, into a unified sum-of-products computation over a semiring $R$. While the hypothesis space of decision trees is doubly exponential with respect to the maximum depth $\Delta$, our dynamic programming algorithm achieves $O^*(n^{O(\Delta)})$ time complexity in the number of features $n$, where $O^*$ suppresses polynomial factors. To handle complex constraints consisting of multiple tree metrics, we introduce model behavior tensors that aggregate semiring values via convolution products over a tensor semiring. This algebraic approach efficiently constructs a model profile that captures the global landscape and trade-offs between criteria such as accuracy, size, and fairness. We demonstrate the utility of our software, emtrees, on real-world datasets, illustrating how ADTC facilitates evidence-based model selection in sensitive domains.

顶级标签: machine learning model evaluation explainable ai
详细标签: decision trees algebraic model counting dynamic programming model selection fairness 或 搜索:

用于最优决策树全局分析的代数模型计数 / Algebraic Model Counting for Global Analysis of Optimal Decision Trees


1️⃣ 一句话总结

本文提出了一种名为代数决策树计数(ADTC)的新方法,通过将多种分析任务(如优化、计数和采样)统一为半环上的求和运算,并采用动态规划算法,从而能够高效地对最优及近似最优决策树的整个假设空间进行全局分析,帮助用户在准确率、规模和公平性等指标之间做出基于证据的模型选择。

源自 arXiv: 2607.02069