菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-16
📄 Abstract - Predicting Post-Traumatic Epilepsy from Clinical Records using Large Language Model Embeddings

Objective: Post-traumatic epilepsy (PTE) is a debilitating neurological disorder that develops after traumatic brain injury (TBI). Early prediction of PTE remains challenging due to heterogeneous clinical data, limited positive cases, and reliance on resource-intensive neuroimaging data. We investigate whether routinely collected acute clinical records alone can support early PTE prediction using language model-based approaches. Methods: Using a curated subset of the TRACK-TBI cohort, we developed an automated PTE prediction framework that implements pretrained large language models (LLMs) as fixed feature extractors to encode clinical records. Tabular features, LLM-generated embeddings, and hybrid feature representations were evaluated using gradient-boosted tree classifiers under stratified cross-validation. Results: LLM embeddings achieved performance improvements by capturing contextual clinical information compared to using tabular features alone. The best performance was achieved by a modality-aware feature fusion strategy combining tabular features and LLM embeddings, achieving an AUC-ROC of 0.892 and AUPRC of 0.798. Acute post-traumatic seizures, injury severity, neurosurgical intervention, and ICU stay are key contributors to the predictive performance. Significance: These findings demonstrate that routine acute clinical records contain information suitable for early PTE risk prediction using LLM embeddings in conjunction with gradient-boosted tree classifiers. This approach represents a promising complement to imaging-based prediction.

顶级标签: llm medical natural language processing
详细标签: clinical prediction feature extraction gradient boosting traumatic brain injury risk stratification 或 搜索:

利用大语言模型嵌入从临床记录预测创伤后癫痫 / Predicting Post-Traumatic Epilepsy from Clinical Records using Large Language Model Embeddings


1️⃣ 一句话总结

这项研究提出了一种新方法,通过大语言模型分析常规的急性期临床记录,结合梯度提升树模型,能够有效预测脑外伤后发生癫痫的风险,为早期预警提供了不依赖昂贵影像检查的补充手段。

源自 arXiv: 2604.14547