ChatPaper.aiChatPaper

WorldGen:從文本到可通行與互動的3D世界

WorldGen: From Text to Traversable and Interactive 3D Worlds

November 20, 2025
作者: Dilin Wang, Hyunyoung Jung, Tom Monnier, Kihyuk Sohn, Chuhang Zou, Xiaoyu Xiang, Yu-Ying Yeh, Di Liu, Zixuan Huang, Thu Nguyen-Phuoc, Yuchen Fan, Sergiu Oprea, Ziyan Wang, Roman Shapovalov, Nikolaos Sarafianos, Thibault Groueix, Antoine Toisoul, Prithviraj Dhar, Xiao Chu, Minghao Chen, Geon Yeong Park, Mahima Gupta, Yassir Azziz, Rakesh Ranjan, Andrea Vedaldi
cs.AI

摘要

我們推出WorldGen系統,該系統能夠直接根據文字提示自動創建大規模互動式3D世界。我們的方法將自然語言描述轉換為可穿越、具完整紋理的環境,這些環境能立即在標準遊戲引擎中進行探索或編輯。透過結合大型語言模型驅動的場景佈局推理、程序化生成、基於擴散模型的3D生成以及物件感知的場景分解技術,WorldGen在創作意圖與功能性虛擬空間之間搭建橋樑,使創作者無需手動建模或具備專業3D知識即可設計出連貫且可導航的世界。本系統採用完全模組化設計,支援對佈局、比例與風格的細粒度控制,所生成的世界兼具幾何一致性、視覺豐富性與即時渲染效率。這項工作標誌著我們向可規模化的生成式世界建構邁進一步,推動3D生成式AI在遊戲、模擬及沉浸式社交環境等應用的技術邊界。
English
We introduce WorldGen, a system that enables the automatic creation of large-scale, interactive 3D worlds directly from text prompts. Our approach transforms natural language descriptions into traversable, fully textured environments that can be immediately explored or edited within standard game engines. By combining LLM-driven scene layout reasoning, procedural generation, diffusion-based 3D generation, and object-aware scene decomposition, WorldGen bridges the gap between creative intent and functional virtual spaces, allowing creators to design coherent, navigable worlds without manual modeling or specialized 3D expertise. The system is fully modular and supports fine-grained control over layout, scale, and style, producing worlds that are geometrically consistent, visually rich, and efficient to render in real time. This work represents a step towards accessible, generative world-building at scale, advancing the frontier of 3D generative AI for applications in gaming, simulation, and immersive social environments.
PDF183December 1, 2025