导演之心:通过协作决策实现多模态智能体驱动的电影预可视化 / Mind-of-Director: Multi-modal Agent-Driven Film Previsualization via Collaborative Decision-Making
1️⃣ 一句话总结
这篇论文提出了一个名为‘导演之心’的多智能体协作框架,它能像电影制作团队一样协同工作,将一个创意想法自动转化为包含剧本、场景、角色动作和镜头规划的高质量电影预可视化序列,整个过程大约只需25分钟。
We present Mind-of-Director, a multi-modal agent-driven framework for film previz that models the collaborative decision-making process of a film production team. Given a creative idea, Mind-of-Director orchestrates multiple specialized agents to produce previz sequences within the game engine. The framework consists of four cooperative modules: Script Development, where agents draft and refine the screenplay iteratively; Virtual Scene Design, which transforms text into semantically aligned 3D environments; Character Behaviour Control, which determines character blocking and motion; and Camera Planning, which optimizes framing, movement, and composition for cinematic camera effects. A real-time visual editing system built in the game engine further enables interactive inspection and synchronized timeline adjustment across scenes, behaviours, and cameras. Extensive experiments and human evaluations show that Mind-of-Director generates high-quality, semantically grounded previz sequences in approximately 25 minutes per idea, demonstrating the effectiveness of agent collaboration for both automated prototyping and human-in-the-loop filmmaking.
导演之心:通过协作决策实现多模态智能体驱动的电影预可视化 / Mind-of-Director: Multi-modal Agent-Driven Film Previsualization via Collaborative Decision-Making
这篇论文提出了一个名为‘导演之心’的多智能体协作框架,它能像电影制作团队一样协同工作,将一个创意想法自动转化为包含剧本、场景、角色动作和镜头规划的高质量电影预可视化序列,整个过程大约只需25分钟。
源自 arXiv: 2603.14790