菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-24
📄 Abstract - SurgAtt-Tracker: Online Surgical Attention Tracking via Temporal Proposal Reranking and Motion-Aware Refinement

Accurate and stable field-of-view (FoV) guidance is critical for safe and efficient minimally invasive surgery, yet existing approaches often conflate visual attention estimation with downstream camera control or rely on direct object-centric assumptions. In this work, we formulate surgical attention tracking as a spatio-temporal learning problem and model surgeon focus as a dense attention heatmap, enabling continuous and interpretable frame-wise FoV guidance. We propose SurgAtt-Tracker, a holistic framework that robustly tracks surgical attention by exploiting temporal coherence through proposal-level reranking and motion-aware refinement, rather than direct regression. To support systematic training and evaluation, we introduce SurgAtt-1.16M, a large-scale benchmark with a clinically grounded annotation protocol that enables comprehensive heatmap-based attention analysis across procedures and institutions. Extensive experiments on multiple surgical datasets demonstrate that SurgAtt-Tracker consistently achieves state-of-the-art performance and strong robustness under occlusion, multi-instrument interference, and cross-domain settings. Beyond attention tracking, our approach provides a frame-wise FoV guidance signal that can directly support downstream robotic FoV planning and automatic camera control.

顶级标签: computer vision medical systems
详细标签: surgical attention tracking temporal modeling heatmap prediction robotic guidance benchmark 或 搜索:

SurgAtt-Tracker:通过时序提议重排序与运动感知精细化实现在线手术注意力追踪 / SurgAtt-Tracker: Online Surgical Attention Tracking via Temporal Proposal Reranking and Motion-Aware Refinement


1️⃣ 一句话总结

这篇论文提出了一种名为SurgAtt-Tracker的新方法,通过分析视频序列中外科医生注意力的时空变化来实时追踪手术视野焦点,并建立了一个大规模数据集用于训练和评估,该方法在多种干扰下表现稳健,能为手术机器人的自动视野规划提供直接指导。

源自 arXiv: 2602.20636