菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-21
📄 Abstract - RF-HiT: Rectified Flow Hierarchical Transformer for General Medical Image Segmentation

Accurate medical image segmentation requires both long-range contextual reasoning and precise boundary delineation, a task where existing transformer- and diffusion-based paradigms are frequently bottlenecked by quadratic computational complexity and prohibitive inference latency. We propose RF-HiT, a Rectified Flow Hierarchical Transformer that integrates an hourglass transformer backbone with a multi-scale hierarchical encoder for anatomically guided feature conditioning. Unlike prior diffusion-based approaches, RF-HiT leverages rectified flow with efficient transformer blocks to achieve linear complexity while requiring only a few discretization steps. The model further fuses conditioning features across resolutions via learnable interpolation, enabling effective multi-scale representation with minimal computational overhead. As a result, RF-HiT achieves a strong efficiency-performance trade-off, requiring only 10.14 GFLOPs, 13.6M parameters, and inference in as few as three steps. Despite its compact design, RF-HiT attains 91.27% mean Dice on ACDC and 87.40% on BraTS 2021, achieving performance comparable to or exceeding that of significantly more intensive architectures. This demonstrates its strong potential as a robust, computationally efficient foundation for real-time clinical segmentation.

顶级标签: medical computer vision model training
详细标签: image segmentation rectified flow transformer efficient inference multi-scale 或 搜索:

RF-HiT:用于通用医学图像分割的修正流层次化Transformer / RF-HiT: Rectified Flow Hierarchical Transformer for General Medical Image Segmentation


1️⃣ 一句话总结

本文提出了一种名为RF-HiT的高效医学图像分割模型,通过结合层次化Transformer和修正流技术,在保持低计算量和极少推理步数的同时,实现了与大型复杂模型相当的高精度分割性能,适合实时临床应用。

源自 arXiv: 2604.19570