ToonComposer:透過生成式後關鍵幀技術簡化卡通製作流程
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
August 14, 2025
作者: Lingen Li, Guangzhi Wang, Zhaoyang Zhang, Yaowei Li, Xiaoyu Li, Qi Dou, Jinwei Gu, Tianfan Xue, Ying Shan
cs.AI
摘要
傳統卡通與動漫製作包含關鍵幀繪製、中間幀生成及上色等階段,這些步驟需要大量的人工投入。儘管人工智慧領域近期取得了顯著進展,現有方法往往將這些階段分開處理,導致錯誤累積和視覺瑕疵。例如,中間幀生成技術在處理大幅動作時存在困難,而上色方法則需要密集的逐幀草圖。為解決這些問題,我們推出了ToonComposer,這是一個將中間幀生成與上色統一整合至關鍵幀後處理階段的生成模型。ToonComposer採用稀疏草圖注入機制,利用關鍵幀草圖實現精確控制。此外,它通過空間低秩適配器的卡通適應方法,將現代視頻基礎模型調整至卡通領域,同時保持其時間先驗不變。ToonComposer僅需單一草圖和一個參考上色幀即可出色處理稀疏輸入,同時支持在任何時間點使用多個草圖以實現更精確的動作控制。這種雙重能力減少了人工工作量並提升了靈活性,在實際應用中賦能藝術家。為評估我們的模型,我們進一步創建了PKBench,這是一個包含模擬真實使用場景的手繪草圖的基準測試集。我們的評估結果表明,ToonComposer在視覺質量、動作一致性及製作效率上均優於現有方法,為AI輔助卡通製作提供了一個更優越且更靈活的解決方案。
English
Traditional cartoon and anime production involves keyframing, inbetweening,
and colorization stages, which require intensive manual effort. Despite recent
advances in AI, existing methods often handle these stages separately, leading
to error accumulation and artifacts. For instance, inbetweening approaches
struggle with large motions, while colorization methods require dense per-frame
sketches. To address this, we introduce ToonComposer, a generative model that
unifies inbetweening and colorization into a single post-keyframing stage.
ToonComposer employs a sparse sketch injection mechanism to provide precise
control using keyframe sketches. Additionally, it uses a cartoon adaptation
method with the spatial low-rank adapter to tailor a modern video foundation
model to the cartoon domain while keeping its temporal prior intact. Requiring
as few as a single sketch and a colored reference frame, ToonComposer excels
with sparse inputs, while also supporting multiple sketches at any temporal
location for more precise motion control. This dual capability reduces manual
workload and improves flexibility, empowering artists in real-world scenarios.
To evaluate our model, we further created PKBench, a benchmark featuring
human-drawn sketches that simulate real-world use cases. Our evaluation
demonstrates that ToonComposer outperforms existing methods in visual quality,
motion consistency, and production efficiency, offering a superior and more
flexible solution for AI-assisted cartoon production.