面向动作识别的部分骨架可见性:一种受限视野方法 / Partial Skeleton Visibility for Action Recognition: A Constrained Field-of-View Approach
1️⃣ 一句话总结
本文提出了一种名为PartialVisGraph的超图框架,通过引入可学习的虚拟超边和可见性先验机制,有效解决了在视野受限场景下骨架关键点部分缺失对动作识别性能的严重影响,在模拟视野遮挡的数据集上取得了显著优于现有方法的效果。
Skeleton-based action recognition has achieved remarkable success by exploiting joint coordinates and their topological connections, yet prevailing methods overwhelmingly assume complete and clean skeleton inputs. In real-world deployments, such as egocentric vision, crowded surveillance, wearable devices, or edge robotics, limited field-of-view (FoV) frequently causes substantial joint visibility dropout, leading to severe performance degradation that existing models are largely unprepared to handle. To bridge this critical yet underexplored gap, we introduce PartialVisGraph, a novel hypergraph framework tailored for robust skeleton action recognition under constrained FoV. We first construct highly expressive hypergraphs by introducing learnable virtual hyperedges that form a soft incidence matrix, capturing flexible high-order dependencies beyond conventional pairwise graphs. We then propose the Single-Head Sample-Adaptive Transformer, which adaptively aggregates joint features onto hyperedges while explicitly incorporating a visibility prior. This prior selectively gates information flow, preventing occluded or out-of-view joints from corrupting reliable feature propagation. We further establish rigorous evaluation protocols with realistic FoV simulation benchmarks on NTU RGB+D 60 and 120. Extensive experiments demonstrate that PartialVisGraph consistently achieves state-of-the-art accuracy under partial visibility, with gains of up to 68.8\% on subsets with severe FoV restrictions compared to recent strong baselines, while remaining superior on full-visibility settings. Our approach offers a principled and practical pathway toward deployable skeleton-based action understanding in unconstrained environments.
面向动作识别的部分骨架可见性:一种受限视野方法 / Partial Skeleton Visibility for Action Recognition: A Constrained Field-of-View Approach
本文提出了一种名为PartialVisGraph的超图框架,通过引入可学习的虚拟超边和可见性先验机制,有效解决了在视野受限场景下骨架关键点部分缺失对动作识别性能的严重影响,在模拟视野遮挡的数据集上取得了显著优于现有方法的效果。
源自 arXiv: 2607.00716