菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-02
📄 Abstract - End-to-End Shared Attention Estimation via Group Detection with Feedback Refinement

This paper proposes an end-to-end shared attention estimation method via group detection. Most previous methods estimate shared attention (SA) without detecting the actual group of people focusing on it, or assume that there is a single SA point in a given image. These issues limit the applicability of SA detection in practice and impact performance. To address them, we propose to simultaneously achieve group detection and shared attention estimation using a two step process: (i) the generation of SA heatmaps relying on individual gaze attention heatmaps and group membership scalars estimated in a group inference; (ii) a refinement of the initial group memberships allowing to account for the initial SA heatmaps, and the final prediction of the SA heatmap. Experiments demonstrate that our method outperforms other methods in group detection and shared attention estimation. Additional analyses validate the effectiveness of the proposed components. Code: this https URL.

顶级标签: computer vision multi-modal model evaluation
详细标签: shared attention group detection gaze estimation heatmap refinement end-to-end learning 或 搜索:

基于群体检测与反馈优化的端到端共享注意力估计 / End-to-End Shared Attention Estimation via Group Detection with Feedback Refinement


1️⃣ 一句话总结

这篇论文提出了一种新方法,通过同时检测图像中的人群分组并估计他们共同关注的点,解决了以往方法无法处理多组人群或需要预先知道分组的问题,从而更准确地识别出人们在看哪里。

源自 arXiv: 2604.01714