菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-26
📄 Abstract - Data-driven sparse identification of governing PDEs via knockoff filters and multi-criteria trade-offs

We propose KO-PDE-IDENT, a data-driven framework for identifying parsimonious partial differential equations (PDEs) with false discovery rate (FDR) control. PDE discovery from noisy observations is often hindered by extreme multicollinearity among candidate terms, which causes typical sparse-regression methods to select spurious terms. To address this problem, KO-PDE-IDENT initially mines a support set of potential candidate terms via model-X knockoff filters with finite-sample FDR control, then refines and ranks the surviving PDE alternatives. The framework integrates three components. First, knockoff feature statistics are constructed by coupling $\ell_{0}$-constrained adaptive best-subset selection with SHapley Additive exPlanations (SHAP), yielding an effective and computationally efficient difference statistic. Second, a recursive feature elimination (RFE) procedure removes terms whose marginal contributions are dispensable and assesses statistical necessity through knockoff-perturbed hypothesis testing. Third, the final model selection is formulated as a multi-criteria decision-making (MCDM) problem, where the optimal governing equation is the alternative that best balances a wide range of criteria such as predictive accuracy, model complexity and coefficient uncertainty. We validate KO-PDE-IDENT on five canonical PDEs under severe noise corruption. Empirical results show that our framework can exactly recover the true PDE structure, eliminating false discoveries while retaining all true underlying terms, with low coefficient estimation error.

顶级标签: machine learning systems model evaluation
详细标签: pde discovery knockoff filters false discovery rate sparse regression multi-criteria decision making 或 搜索:

基于敲除过滤与多准则权衡的数据驱动稀疏偏微分方程识别 / Data-driven sparse identification of governing PDEs via knockoff filters and multi-criteria trade-offs


1️⃣ 一句话总结

该论文提出了一种名为KO-PDE-IDENT的框架,能够从含有噪音的观测数据中自动识别出简洁的偏微分方程,并严格控制误报术语,通过组合特征选择、统计检验和多准则决策来获得既准确又简洁的方程。

源自 arXiv: 2605.26631