菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - Picid: A Modular Evaluation Infrastructure for Reproducible PHM Across Tasks and Domains

Progress in Prognostics and Health Management (PHM) is hindered by the lack of standardized and reusable evaluation practices across tasks, datasets, and application domains. Reported results are often difficult to reproduce and compare, as key protocol choices, such as data splits, preprocessing, label alignment, temporal windowing, and metrics, are often implicit or implemented ad hoc. We introduce \picid, a modular evaluation infrastructure that formalizes the PHM evaluation pipeline as an explicit, executable, and reproducible protocol. Through well-defined abstractions, \picid enforces deterministic, leakage-safe dataset construction while remaining flexible across diverse PHM settings. The framework supports fault detection, diagnostics, and prognostics through a unified interface and can be extended to new datasets and model classes without violating protocol invariants. By standardizing data contracts and evaluation boundaries, \picid also enables fair cross-task comparisons across diagnostics (classification) and prognostics (regression), allowing identical model families to be evaluated consistently across heterogeneous settings. We demonstrate \picid through an empirical evaluation of thirteen models on twelve datasets spanning batteries, bearings, turbofan engines, hydraulics, filtration systems, and buildings. This work establishes a reusable foundation for standardized, fair and reproducible evaluation in PHM.

顶级标签: systems model evaluation
详细标签: prognostics and health management reproducibility benchmark evaluation infrastructure standardized protocol 或 搜索:

Picid:面向跨任务和跨领域可复现PHM的模块化评估基础设施 / Picid: A Modular Evaluation Infrastructure for Reproducible PHM Across Tasks and Domains


1️⃣ 一句话总结

本文提出了一种名为Picid的模块化工具,通过标准化数据拆分、预处理、标签对齐和评估指标等关键环节,确保在故障诊断、预测等不同PHM任务中,模型评估结果的可复现性和公平可比性,并在十二个数据集上验证了其有效性。

源自 arXiv: 2605.28345