HMAR:用于医学图像检索的层级化模态感知专家与动态路由架构 / HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture
1️⃣ 一句话总结
这篇论文提出了一个名为HMAR的智能医学图像检索新框架,它通过两个分工不同的专家模块,既能匹配整张图像的全局特征,又能精确查找病灶区域,从而在无需昂贵标注的情况下,实现了更精准、更符合临床需求的图像检索。
Medical image retrieval (MIR) is a critical component of computer-aided diagnosis, yet existing systems suffer from three persistent limitations: uniform feature encoding that fails to account for the varying clinical importance of anatomical structures, ambiguous similarity metrics based on coarse classification labels, and an exclusive focus on global image similarity that cannot meet the clinical demand for fine-grained region-specific retrieval. We propose HMAR (Hierarchical Modality-Aware Expert and Dynamic Routing), an adaptive retrieval framework built on a Mixture-of-Experts (MoE) architecture. HMAR employs a dual-expert mechanism: Expert0 extracts global features for holistic similarity matching, while Expert1 learns position-invariant local representations for precise lesion-region retrieval. A two-stage contrastive learning strategy eliminates the need for expensive bounding-box annotations, and a sliding-window matching algorithm enables dense local comparison at inference time. Hash codes are generated via Kolmogorov-Arnold Network (KAN) layers for efficient Hamming-distance search. Experiments on the RadioImageNet-CT dataset (16 clinical patterns, 29,903 images) show that HMAR achieves mean Average Precision (mAP) of 0.711 and 0.724 for 64-bit and 128-bit hash codes, improving over the state-of-the-art ACIR method by 0.7% and 1.1%, respectively.
HMAR:用于医学图像检索的层级化模态感知专家与动态路由架构 / HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture
这篇论文提出了一个名为HMAR的智能医学图像检索新框架,它通过两个分工不同的专家模块,既能匹配整张图像的全局特征,又能精确查找病灶区域,从而在无需昂贵标注的情况下,实现了更精准、更符合临床需求的图像检索。
源自 arXiv: 2603.16679