菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-29
📄 Abstract - CurEvo: Curriculum-Guided Self-Evolution for Video Understanding

Recent advances in self-evolution video understanding frameworks have demonstrated the potential of autonomous learning without human annotations. However, existing methods often suffer from weakly controlled optimization and uncontrolled difficulty progression, as they lack structured guidance throughout the iterative learning process. To address these limitations, we propose CurEvo, a curriculum-guided self-evolution framework that introduces curriculum learning into self-evolution to achieve more structured and progressive model improvement. CurEvo dynamically regulates task difficulty, refines evaluation criteria, and balances data diversity according to model competence, forming a curriculum-guided feedback loop that aligns learning complexity with model capability. Built upon this principle, we develop a multi-dimensional adaptive QA framework that jointly evolves question generation and answer evaluation across perception, recognition, and understanding dimensions, ensuring coherent and measurable curriculum progression. Through this integration, CurEvo transforms weakly controlled self-evolution into a more structured learning process for autonomous video understanding. Across seven backbones, CurEvo consistently improves both benchmark accuracy and evaluator-based semantic score on four VideoQA benchmarks, validating the effectiveness of curriculum-guided self-evolution for video understanding.

顶级标签: multi-modal model training video
详细标签: curriculum learning self-evolution video question answering question generation evaluation 或 搜索:

CurEvo:课程引导的自我进化视频理解框架 / CurEvo: Curriculum-Guided Self-Evolution for Video Understanding


1️⃣ 一句话总结

CurEvo通过引入课程学习机制,让视频理解模型在无需人工标注的情况下,根据自身能力动态调整学习任务的难度和多样性,从而像学生上课一样循序渐进地自我提升,显著提高了视频问答的准确性和语义理解能力。

源自 arXiv: 2604.26707