看得更远更广:面向微视频流行度预测的联合时空扩展方法 / Seeing Further and Wider: Joint Spatio-Temporal Enlargement for Micro-Video Popularity Prediction
1️⃣ 一句话总结
本文提出了一种统一框架,通过时间维度的长序列精准感知和空间维度的无限可扩展记忆库,克服了现有微视频流行度预测方法在稀疏采样和有限存储上的局限,显著提升了预测准确性与排序一致性。
Micro-video popularity prediction (MVPP) aims to forecast the future popularity of videos on online media, which is essential for applications such as content recommendation and traffic allocation. In real-world scenarios, it is critical for MVPP approaches to understand both the temporal dynamics of a given video (temporal) and its historical relevance to other videos (spatial). However, existing approaches sufer from limitations in both dimensions: temporally, they rely on sparse short-range sampling that restricts content perception; spatially, they depend on flat retrieval memory with limited capacity and low efficiency, hindering scalable knowledge utilization. To overcome these limitations, we propose a unified framework that achieves joint spatio-temporal enlargement, enabling precise perception of extremely long video sequences while supporting a scalable memory bank that can infinitely expand to incorporate all relevant historical videos. Technically, we employ a Temporal Enlargement driven by a frame scoring module that extracts highlight cues from video frames through two complementary pathways: sparse sampling and dense perception. Their outputs are adaptively fused to enable robust long-sequence content understanding. For Spatial Enlargement, we construct a Topology-Aware Memory Bank that hierarchically clusters historically relevant content based on topological relationships. Instead of directly expanding memory capacity, we update the encoder features of the corresponding clusters when incorporating new videos, enabling unbounded historical association without unbounded storage growth. Extensive experiments on three widely used MVPP benchmarks demonstrate that our method consistently outperforms 11 strong baselines across mainstream metrics, achieving robust improvements in both prediction accuracy and ranking consistency.
看得更远更广:面向微视频流行度预测的联合时空扩展方法 / Seeing Further and Wider: Joint Spatio-Temporal Enlargement for Micro-Video Popularity Prediction
本文提出了一种统一框架,通过时间维度的长序列精准感知和空间维度的无限可扩展记忆库,克服了现有微视频流行度预测方法在稀疏采样和有限存储上的局限,显著提升了预测准确性与排序一致性。
源自 arXiv: 2604.20311