ChatPaper.aiChatPaper

WorldPlay:邁向即時互動世界建模的長期幾何一致性

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

December 16, 2025
作者: Wenqiang Sun, Haiyu Zhang, Haoyuan Wang, Junta Wu, Zehan Wang, Zhenwei Wang, Yunhong Wang, Jun Zhang, Tengfei Wang, Chunchao Guo
cs.AI

摘要

本文提出WorldPlay——一款能夠實現即時互動式世界建模的串流視訊擴散模型,該模型透過長期幾何一致性解決了現有方法在速度與記憶體間的權衡難題。WorldPlay的優勢源自三大創新:1)採用雙重動作表徵技術,使模型能根據用戶鍵鼠輸入實現強健的動作控制;2)透過重構上下文記憶機制動態重建歷史影格上下文,並利用時序重構技術保持幾何關鍵影格的可訪問性,有效緩解記憶衰減問題;3)提出專為記憶感知模型設計的新型蒸餾方法「上下文強制對齊」,通過保持師生模型間的記憶上下文一致性,在實現即時生成速度的同時維持長程資訊利用能力,防止誤差漂移。綜合這些技術,WorldPlay能以24 FPS生成720p長序列串流影片,在一致性方面優於現有技術,並展現出跨場景的強泛化能力。項目頁面與線上演示見:https://3d-models.hunyuan.tencent.com/world/ 與 https://3d.hunyuan.tencent.com/sceneTo3D。
English
This paper presents WorldPlay, a streaming video diffusion model that enables real-time, interactive world modeling with long-term geometric consistency, resolving the trade-off between speed and memory that limits current methods. WorldPlay draws power from three key innovations. 1) We use a Dual Action Representation to enable robust action control in response to the user's keyboard and mouse inputs. 2) To enforce long-term consistency, our Reconstituted Context Memory dynamically rebuilds context from past frames and uses temporal reframing to keep geometrically important but long-past frames accessible, effectively alleviating memory attenuation. 3) We also propose Context Forcing, a novel distillation method designed for memory-aware model. Aligning memory context between the teacher and student preserves the student's capacity to use long-range information, enabling real-time speeds while preventing error drift. Taken together, WorldPlay generates long-horizon streaming 720p video at 24 FPS with superior consistency, comparing favorably with existing techniques and showing strong generalization across diverse scenes. Project page and online demo can be found: https://3d-models.hunyuan.tencent.com/world/ and https://3d.hunyuan.tencent.com/sceneTo3D.
PDF521December 18, 2025