菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-05
📄 Abstract - S^2tory: Story Spine Distillation for Movie Script Summarization

Movie scripts pose a fundamental challenge for automatic summarization due to their non-linear, cross-cut narrative structure, which makes surface-level saliency methods ineffective at preserving core story progression. To address this, we introduce S^2tory (Story Spine Distillation), a narratology-grounded framework that leverages character development trajectories to identify plot nuclei, the essential events that drive the narrative forward, while filtering out peripheral satellite events that merely enrich atmosphere or emotion. Our Narrative Expert Agent (NEAgent) performs theory-constrained reasoning, whose distilled knowledge conditions a small model to identify plot nuclei. Another model then uses these plot nuclei to generate the summary. Experiments on the MovieSum dataset demonstrate state-of-the-art semantic fidelity at approximately 3.5x compression, and zero-shot evaluation on BookSum confirms strong out-of-domain generalization. Human evaluation further validates that narratological theory provides an indispensable foundation for modeling complex, non-linear narratives.

顶级标签: natural language processing llm multi-modal
详细标签: movie script summarization narrative structure plot nuclei detection knowledge distillation evaluation 或 搜索:

S²tory:电影剧本摘要生成的故事主线蒸馏技术 / S^2tory: Story Spine Distillation for Movie Script Summarization


1️⃣ 一句话总结

本文提出了一种基于叙事学理论的方法,通过分析角色发展轨迹自动提取电影剧本中的关键情节核心,并过滤掉非核心的修饰性事件,从而生成更忠实于故事主线的简洁摘要,在多个数据集上取得了优于现有模型的效果。

源自 arXiv: 2605.03244