菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-25
📄 Abstract - A multifractal-based masked auto-encoder: an application to medical images

Masked autoencoders (MAE) have shown great promise in medical image classification. However, the random masking strategy employed by traditional MAEs may overlook critical areas in medical images, where even subtle changes can indicate disease. To address this limitation, we propose a novel approach that utilizes a multifractal measure (Renyi entropy) to optimize the masking strategy. Our method, termed Multifractal-Optimized Masked Autoencoder (MO-MAE), employs a multifractal analysis to identify regions of high complexity and information content. By focusing the masking process on these areas, MO-MAE ensures that the model learns to reconstruct the most diagnostically relevant features. This approach is particularly beneficial for medical imaging, where fine-grained inspection of tissue structures is crucial for accurate diagnosis. We evaluate MO-MAE on several medical datasets covering various diseases, including MedMNIST and COVID-CT. Our results demonstrate that MO-MAE achieves promising performance, surpassing other basiline and state-of-the-art models. The proposed method also adds minimum computational overhead as the computation of the proposed measure is straightforward. Our findings suggest that the multifractal-optimized masking strategy enhances the model's ability to capture and reconstruct complex tissue structures, leading to more accurate and efficient medical image representation. The proposed MO-MAE framework offers a promising direction for improving the accuracy and efficiency of deep learning models in medical image analysis, potentially advancing the field of computer-aided diagnosis.

顶级标签: medical computer vision model training
详细标签: masked autoencoder multifractal analysis medical image classification masking strategy self-supervised learning 或 搜索:

基于多重分形的掩码自编码器:在医学图像中的应用 / A multifractal-based masked auto-encoder: an application to medical images


1️⃣ 一句话总结

本文提出一种名为MO-MAE的新方法,通过使用多重分形分析(基于Renyi熵)来优化掩码自编码器的随机掩码策略,使其更关注医学图像中信息丰富且与诊断相关的复杂区域,从而在保持低计算开销的同时显著提升医学图像分类和特征重建的性能。

源自 arXiv: 2605.26287