基於多智能體思維鏈規劃的自動電影生成
Automated Movie Generation via Multi-Agent CoT Planning
March 10, 2025
作者: Weijia Wu, Zeyu Zhu, Mike Zheng Shou
cs.AI
摘要
現有的長視頻生成框架缺乏自動化規劃,需要手動輸入劇情、場景、攝影和角色互動,導致高成本和低效率。為解決這些挑戰,我們提出了MovieAgent,這是一個通過多代理思維鏈(CoT)規劃實現的自動化電影生成系統。MovieAgent具有兩大優勢:1)我們首次探索並定義了自動化電影/長視頻生成的範式。給定劇本和角色庫,我們的MovieAgent能夠生成多場景、多鏡頭的長視頻,並確保敘事連貫、角色一致、字幕同步以及音頻穩定。2)MovieAgent引入了基於分層CoT的推理過程,自動構建場景、攝影機設置和攝影技術,顯著減少了人力投入。通過使用多個大型語言模型(LLM)代理來模擬導演、編劇、分鏡師和場地經理的職責,MovieAgent簡化了製作流程。實驗表明,MovieAgent在劇本忠實度、角色一致性和敘事連貫性方面達到了新的最先進水平。我們的分層框架邁出了重要一步,為全自動電影生成提供了新的見解。代碼和項目網站可在以下網址獲取:https://github.com/showlab/MovieAgent 和 https://weijiawu.github.io/MovieAgent。
English
Existing long-form video generation frameworks lack automated planning,
requiring manual input for storylines, scenes, cinematography, and character
interactions, resulting in high costs and inefficiencies. To address these
challenges, we present MovieAgent, an automated movie generation via
multi-agent Chain of Thought (CoT) planning. MovieAgent offers two key
advantages: 1) We firstly explore and define the paradigm of automated
movie/long-video generation. Given a script and character bank, our MovieAgent
can generates multi-scene, multi-shot long-form videos with a coherent
narrative, while ensuring character consistency, synchronized subtitles, and
stable audio throughout the film. 2) MovieAgent introduces a hierarchical
CoT-based reasoning process to automatically structure scenes, camera settings,
and cinematography, significantly reducing human effort. By employing multiple
LLM agents to simulate the roles of a director, screenwriter, storyboard
artist, and location manager, MovieAgent streamlines the production pipeline.
Experiments demonstrate that MovieAgent achieves new state-of-the-art results
in script faithfulness, character consistency, and narrative coherence. Our
hierarchical framework takes a step forward and provides new insights into
fully automated movie generation. The code and project website are available
at: https://github.com/showlab/MovieAgent and
https://weijiawu.github.io/MovieAgent.Summary
AI-Generated Summary