使用混合注意力-卷积框架进行膀胱血管分割 / Bladder Vessel Segmentation using a Hybrid Attention-Convolution Framework
1️⃣ 一句话总结
这篇论文提出了一种结合Transformer和CNN的混合模型,用于从有缺陷的膀胱内窥镜图像中准确分割血管,以解决膀胱癌手术中因器官变形和图像干扰导致的导航难题,并在抑制黏膜褶皱等误报方面表现优异。
Urinary bladder cancer surveillance requires tracking tumor sites across repeated interventions, yet the deformable and hollow bladder lacks stable landmarks for orientation. While blood vessels visible during endoscopy offer a patient-specific "vascular fingerprint" for navigation, automated segmentation is challenged by imperfect endoscopic data, including sparse labels, artifacts like bubbles or variable lighting, continuous deformation, and mucosal folds that mimic vessels. State-of-the-art vessel segmentation methods often fail to address these domain-specific complexities. We introduce a Hybrid Attention-Convolution (HAC) architecture that combines Transformers to capture global vessel topology prior with a CNN that learns a residual refinement map to precisely recover thin-vessel details. To prioritize structural connectivity, the Transformer is trained on optimized ground truth data that exclude short and terminal branches. Furthermore, to address data scarcity, we employ a physics-aware pretraining, that is a self-supervised strategy using clinically grounded augmentations on unlabeled data. Evaluated on the BlaVeS dataset, consisting of endoscopic video frames, our approach achieves high accuracy (0.94) and superior precision (0.61) and clDice (0.66) compared to state-of-the-art medical segmentation models. Crucially, our method successfully suppresses false positives from mucosal folds that dynamically appear and vanish as the bladder fills and empties during surgery. Hence, HAC provides the reliable structural stability required for clinical navigation.
使用混合注意力-卷积框架进行膀胱血管分割 / Bladder Vessel Segmentation using a Hybrid Attention-Convolution Framework
这篇论文提出了一种结合Transformer和CNN的混合模型,用于从有缺陷的膀胱内窥镜图像中准确分割血管,以解决膀胱癌手术中因器官变形和图像干扰导致的导航难题,并在抑制黏膜褶皱等误报方面表现优异。
源自 arXiv: 2602.09949