菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-02
📄 Abstract - Federated Vision Transformer with Adaptive Focal Loss for Medical Image Classification

While deep learning models like Vision Transformer (ViT) have achieved significant advances, they typically require large datasets. With data privacy regulations, access to many original datasets is restricted, especially medical images. Federated learning (FL) addresses this challenge by enabling global model aggregation without data exchange. However, the heterogeneity of the data and the class imbalance that exist in local clients pose challenges for the generalization of the model. This study proposes a FL framework leveraging a dynamic adaptive focal loss (DAFL) and a client-aware aggregation strategy for local training. Specifically, we design a dynamic class imbalance coefficient that adjusts based on each client's sample distribution and class data distribution, ensuring minority classes receive sufficient attention and preventing sparse data from being ignored. To address client heterogeneity, a weighted aggregation strategy is adopted, which adapts to data size and characteristics to better capture inter-client variations. The classification results on three public datasets (ISIC, Ocular Disease and RSNA-ICH) show that the proposed framework outperforms DenseNet121, ResNet50, ViT-S/16, ViT-L/32, FedCLIP, Swin Transformer, CoAtNet, and MixNet in most cases, with accuracy improvements ranging from 0.98\% to 41.69\%. Ablation studies on the imbalanced ISIC dataset validate the effectiveness of the proposed loss function and aggregation strategy compared to traditional loss functions and other FL approaches. The codes can be found at: this https URL.

顶级标签: medical model training machine learning
详细标签: federated learning vision transformer class imbalance medical image classification adaptive focal loss 或 搜索:

基于自适应焦点损失的联邦视觉Transformer用于医学图像分类 / Federated Vision Transformer with Adaptive Focal Loss for Medical Image Classification


1️⃣ 一句话总结

这项研究提出了一种新的联邦学习方法,通过结合动态自适应焦点损失和客户端感知的聚合策略,有效解决了医学图像数据在隐私保护下的类别不平衡和客户端数据差异问题,从而提升了分类模型的性能。

源自 arXiv: 2602.01633