菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-15
📄 Abstract - Artificial intelligence application in lymphoma diagnosis with Vision Transformer using weakly supervised training

Vision transformers (ViT) have been shown to allow for more flexible feature detection and can outperform convolutional neural network (CNN) when pre-trained on sufficient data. Due to their promising feature detection capabilities, we deployed ViTs for morphological classification of anaplastic large cell lymphoma (ALCL) versus classic Hodgkin lymphoma (cHL). We had previously designed a ViT model which was trained on a small dataset of 1,200 image patches in fully supervised training. That model achieved a diagnostic accuracy of 100% and an F1 score of 1.0 on the independent test set. Since fully supervised training is not a practical method due to lack of expertise resources in both the training and testing phases, we conducted a recent study on a modified approach to training data (weakly supervised training) and show that labeling training image patch automatically at the slide level of each whole-slide-image is a more practical solution for clinical use of Vision Transformer. Our ViT model, trained on a larger dataset of 100,000 image patches, yields evaluation metrics with significant accuracy, F1 score, and area under the curve (AUC) at 91.85%, 0.92, and 0.98, respectively. These are respectable values that qualify this ViT model, with weakly supervised training, as a suitable tool for a deep learning module in clinical model development using automated image patch extraction.

顶级标签: medical computer vision model training
详细标签: vision transformer weakly supervised learning lymphoma diagnosis pathology whole-slide image 或 搜索:

基于弱监督训练与Vision Transformer的人工智能在淋巴瘤诊断中的应用 / Artificial intelligence application in lymphoma diagnosis with Vision Transformer using weakly supervised training


1️⃣ 一句话总结

本研究提出了一种使用弱监督训练方法训练的Vision Transformer模型,能够以高准确度自动区分两种淋巴瘤亚型,为临床病理诊断提供了一种更实用的人工智能辅助工具。

源自 arXiv: 2604.13795