Towards Automatic Soccer Commentary Generation with Knowledge-Enhanced Visual Reasoning

📄 Abstract - Towards Automatic Soccer Commentary Generation with Knowledge-Enhanced Visual Reasoning

Soccer commentary plays a crucial role in enhancing the soccer game viewing experience for audiences. Previous studies in automatic soccer commentary generation typically adopt an end-to-end method to generate anonymous live text commentary. Such generated commentary is insufficient in the context of real-world live televised commentary, as it contains anonymous entities, context-dependent errors and lacks statistical insights of the game events. To bridge the gap, we propose GameSight, a two-stage model to address soccer commentary generation as a knowledge-enhanced visual reasoning task, enabling live-televised-like knowledgeable commentary with accurate reference to entities (players and teams). GameSight starts by performing visual reasoning to align anonymous entities with fine-grained visual and contextual analysis. Subsequently, the entity-aligned commentary is refined with knowledge by incorporating external historical statistics and iteratively updated internal game state information. Consequently, GameSight improves the player alignment accuracy by 18.5% on SN-Caption-test-align dataset compared to Gemini 2.5-pro. Combined with further knowledge enhancement, GameSight outperforms in segment-level accuracy and commentary quality, as well as game-level contextual relevance and structural composition. We believe that our work paves the way for a more informative and engaging human-centric experience with the AI sports application. Demo Page: this https URL

迈向知识增强视觉推理的自动足球解说生成 / Towards Automatic Soccer Commentary Generation with Knowledge-Enhanced Visual Reasoning

1️⃣ 一句话总结

这篇论文提出了一个名为GameSight的两阶段模型，它通过结合视觉分析和外部历史数据，能像真人解说员一样准确提及球员和球队，并融入比赛统计信息，从而生成更专业、更吸引人的自动足球解说。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要