菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-23
📄 Abstract - A Statistical Approach for Modeling Irregular Multivariate Time Series with Missing Observations

Irregular multivariate time series with missing values present significant challenges for predictive modeling in domains such as healthcare. While deep learning approaches often focus on temporal interpolation or complex architectures to handle irregularities, we propose a simpler yet effective alternative: extracting time-agnostic summary statistics to eliminate the temporal axis. Our method computes four key features per variable-mean and standard deviation of observed values, as well as the mean and variability of changes between consecutive observations to create a fixed-dimensional representation. These features are then utilized with standard classifiers, such as logistic regression and XGBoost. Evaluated on four biomedical datasets (PhysioNet Challenge 2012, 2019, PAMAP2, and MIMIC-III), our approach achieves state-of-the-art performance, surpassing recent transformer and graph-based models by 0.5-1.7% in AUROC/AUPRC and 1.1-1.7% in accuracy/F1-score, while reducing computational complexity. Ablation studies demonstrate that feature extraction-not classifier choice-drives performance gains, and our summary statistics outperform raw/imputed input in most benchmarks. In particular, we identify scenarios where missing patterns themselves encode predictive signals, as in sepsis prediction (PhysioNet, 2019), where missing indicators alone can achieve 94.2% AUROC with XGBoost, only 1.6% lower than using original raw data as input. Our results challenge the necessity of complex temporal modeling when task objectives permit time-agnostic representations, providing an efficient and interpretable solution for irregular time series classification.

顶级标签: medical machine learning model evaluation
详细标签: multivariate time series missing data feature extraction biomedical classification interpretable models 或 搜索:

一种针对含缺失观测值的不规则多元时间序列的统计建模方法 / A Statistical Approach for Modeling Irregular Multivariate Time Series with Missing Observations


1️⃣ 一句话总结

这篇论文提出了一种简单有效的方法,通过提取与时间无关的统计特征(如均值和变化率)来替代复杂的时序建模,从而在多个医疗数据集上以更低的计算成本实现了优于前沿深度学习模型的分类性能。

源自 arXiv: 2602.19531