菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-16
📄 Abstract - Unsupervised Skeleton-Based Action Segmentation via Hierarchical Spatiotemporal Vector Quantization

We propose a novel hierarchical spatiotemporal vector quantization framework for unsupervised skeleton-based temporal action segmentation. We first introduce a hierarchical approach, which includes two consecutive levels of vector quantization. Specifically, the lower level associates skeletons with fine-grained subactions, while the higher level further aggregates subactions into action-level representations. Our hierarchical approach outperforms the non-hierarchical baseline, while primarily exploiting spatial cues by reconstructing input skeletons. Next, we extend our approach by leveraging both spatial and temporal information, yielding a hierarchical spatiotemporal vector quantization scheme. In particular, our hierarchical spatiotemporal approach performs multi-level clustering, while simultaneously recovering input skeletons and their corresponding timestamps. Lastly, extensive experiments on multiple benchmarks, including HuGaDB, LARa, and BABEL, demonstrate that our approach establishes a new state-of-the-art performance and reduces segment length bias in unsupervised skeleton-based temporal action segmentation.

顶级标签: computer vision machine learning model training
详细标签: action segmentation skeleton-based unsupervised learning vector quantization spatiotemporal modeling 或 搜索:

基于分层时空向量量化的无监督骨架动作分割 / Unsupervised Skeleton-Based Action Segmentation via Hierarchical Spatiotemporal Vector Quantization


1️⃣ 一句话总结

这篇论文提出了一种新的分层时空向量量化方法,无需人工标注就能将连续的骨架动作视频自动分割成有意义的动作片段,并在多个公开数据集上取得了当前最好的效果。

源自 arXiv: 2604.15196