CreatiPoster:邁向可編輯與可控的多層次平面設計生成
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation
June 12, 2025
作者: Zhao Zhang, Yutao Cheng, Dexiang Hong, Maoke Yang, Gonglei Shi, Lei Ma, Hui Zhang, Jie Shao, Xinglong Wu
cs.AI
摘要
在商業與個人領域中,平面設計扮演著至關重要的角色,然而創作高品質、可編輯且美觀的圖形作品仍是一項耗時且需要專業技能的任務,尤其對初學者而言更是如此。現有的AI工具雖能自動化部分工作流程,但在精確整合用戶提供的素材、保持可編輯性以及達到專業視覺效果方面仍存在挑戰。商業系統如Canva Magic Design依賴於龐大的模板庫,這在實際應用中難以複製。本文介紹了CreatiPoster,這是一個框架,能夠根據可選的自然語言指令或素材生成可編輯的多層次設計作品。首先,一個協議模型——RGBA大型多模態模型——生成一份JSON規範,詳細描述每一層(文字或素材)的精確佈局、層次結構、內容與風格,並附上簡潔的背景提示。隨後,一個條件背景模型基於這些渲染的前景層合成一個連貫的背景。我們構建了一個包含自動化評測指標的平面設計生成基準,並展示了CreatiPoster在該基準上超越了領先的開源方法和專有商業系統。為促進進一步研究,我們發布了一個包含10萬個多層次設計的無版權限制的數據集。CreatiPoster支持多種應用場景,如畫布編輯、文字疊加、響應式縮放、多語言適應以及動態海報製作,推動了AI輔助平面設計的普及化。項目主頁:https://github.com/graphic-design-ai/creatiposter
English
Graphic design plays a crucial role in both commercial and personal contexts,
yet creating high-quality, editable, and aesthetically pleasing graphic
compositions remains a time-consuming and skill-intensive task, especially for
beginners. Current AI tools automate parts of the workflow, but struggle to
accurately incorporate user-supplied assets, maintain editability, and achieve
professional visual appeal. Commercial systems, like Canva Magic Design, rely
on vast template libraries, which are impractical for replicate. In this paper,
we introduce CreatiPoster, a framework that generates editable, multi-layer
compositions from optional natural-language instructions or assets. A protocol
model, an RGBA large multimodal model, first produces a JSON specification
detailing every layer (text or asset) with precise layout, hierarchy, content
and style, plus a concise background prompt. A conditional background model
then synthesizes a coherent background conditioned on this rendered foreground
layers. We construct a benchmark with automated metrics for graphic-design
generation and show that CreatiPoster surpasses leading open-source approaches
and proprietary commercial systems. To catalyze further research, we release a
copyright-free corpus of 100,000 multi-layer designs. CreatiPoster supports
diverse applications such as canvas editing, text overlay, responsive resizing,
multilingual adaptation, and animated posters, advancing the democratization of
AI-assisted graphic design. Project homepage:
https://github.com/graphic-design-ai/creatiposter