📄
Abstract - Validation of an AI-based end-to-end model for prostate pathology using long-term archived routine samples
Artificial intelligence (AI) is becoming a clinical tool for prostate pathology, but generalization across variations in sample preparation and preservation over prolonged time periods remains poorly understood. We evaluated GleasonAI, an end-to-end attention-based multiple instance learning model, on an independent validation cohort comprising 10,366 biopsy cores from 1,028 patients across 14 Swedish regions, using archival diagnostic specimens from the ProMort cohorts collected between 1998-2015. The model achieved an overall quadratic-weighted kappa of 0.86 for core-level ISUP grading, comparable to several experienced pathologists and consistent across geographic regions. Notably, performance remained stable across the 17-year collection period, demonstrating robustness to time-related variation in archival material, a property not consistently observed with foundation model-based approaches, with exploratory analysis demonstrating a significant prognostic gradient across AI-assigned grade groups for prostate cancer-specific mortality. These findings support the generalizability of the AI grading model and demonstrate the potential of pathology archives as a large-scale resource for AI development, validation, and retrospective prognostic research.
基于人工智能的前列腺病理端到端模型在长期存档常规样本中的验证 /
Validation of an AI-based end-to-end model for prostate pathology using long-term archived routine samples
1️⃣ 一句话总结
本研究验证了一个名为GleasonAI的人工智能模型,它能够像经验丰富的病理学家一样准确评估前列腺癌的严重程度,并且对来自不同地区、跨越17年之久的存档样本都表现稳定,证明了该模型在实际临床和历史数据中的可靠性和广泛适用性。