菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-29
📄 Abstract - Looking Beyond Accuracy: A Holistic Benchmark of ECG Foundation Models

The electrocardiogram (ECG) is a cost-effective, highly accessible and widely employed diagnostic tool. With the advent of Foundation Models (FMs), the field of AI-assisted ECG interpretation has begun to evolve, as they enable model reuse across different tasks by relying on embeddings. However, to responsibly employ FMs, it is crucial to rigorously assess to which extent the embeddings they produce are generalizable, particularly in error-sensitive domains such as healthcare. Although prior works have already addressed the problem of benchmarking ECG-expert FMs, they focus predominantly on the evaluation of downstream performance. To fill this gap, this study aims to find an in-depth, comprehensive benchmarking framework for FMs, with a specific focus on ECG-expert ones. To this aim, we introduce a benchmark methodology that complements performance-based evaluation with representation-level analysis, leveraging SHAP and UMAP techniques. Furthermore, we rely on the methodology for carrying out an extensive evaluation of several ECG-expert FMs pretrained via state-of-the-art techniques over different cross-continental datasets and data availability settings; this includes ones featuring data scarcity, a fairly common situation in real-world medical scenarios. Experimental results show that our benchmarking protocol provides a rich insight of ECG-expert FMs' embedded patterns, enabling a deeper understanding of their representational structure and generalizability.

顶级标签: medical model evaluation benchmark
详细标签: ecg analysis foundation models representation analysis healthcare ai model generalization 或 搜索:

超越准确率:心电图基础模型的综合性基准评估 / Looking Beyond Accuracy: A Holistic Benchmark of ECG Foundation Models


1️⃣ 一句话总结

这篇论文提出了一个超越传统性能评估的综合性基准框架,通过结合特征表示分析来深入评估心电图基础模型的泛化能力和内在模式,为医疗AI的可靠应用提供更全面的洞见。

源自 arXiv: 2601.21830