菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-11
📄 Abstract - Evaluation metrics for temporal preservation in synthetic longitudinal patient data

This study introduces a set of metrics for evaluating temporal preservation in synthetic longitudinal patient data, defined as artificially generated data that mimic real patients' repeated measurements over time. The proposed metrics assess how synthetic data reproduces key temporal characteristics, categorized into marginal, covariance, individual-level and measurement structures. We show that strong marginal-level resemblance may conceal distortions in covariance and disruptions in individual-level trajectories. Temporal preservation is influenced by factors such as original data quality, measurement frequency, and preprocessing strategies, including binning, variable encoding and precision. Variables with sparse or highly irregular measurement times provide limited information for learning temporal dependencies, resulting in reduced resemblance between the synthetic and original data. No single metric adequately captures temporal preservation; instead, a multidimensional evaluation across all characteristics provides a more comprehensive assessment of synthetic data quality. Overall, the proposed metrics clarify how and why temporal structures are preserved or degraded, enabling more reliable evaluation and improvement of generative models and supporting the creation of temporally realistic synthetic longitudinal patient data.

顶级标签: medical model evaluation data
详细标签: synthetic data temporal evaluation longitudinal data healthcare generative models 或 搜索:

合成纵向患者数据中时间保真度的评估指标 / Evaluation metrics for temporal preservation in synthetic longitudinal patient data


1️⃣ 一句话总结

这项研究提出了一套新的评估指标,用于衡量合成患者数据在时间维度上的保真度,帮助判断人工生成的数据是否真实地再现了病人随时间变化的关键特征,从而更可靠地评估和改进数据生成模型。

源自 arXiv: 2602.10643