8개 토큰으로 구성하는 계획: 잠재 세계 모델을 위한 컴팩트 이산 토큰화 기법

초록

월드 모델은 행동 또는 지시에 조건부된 환경 역학을 시뮬레이션하는 강력한 프레임워크를 제공하여 행동 계획이나 정책 학습과 같은 하위 작업을 가능하게 합니다. 최근 접근법들은 월드 모델을 학습된 시뮬레이터로 활용하고 있지만, 의사 결정 시점 계획에의 적용은 실시간 제어에 있어 여전히 계산 비용이 매우 큽니다. 핵심 병목 현상은 잠재 표현에 있습니다: 기존 토크나이저는 각 관측을 수백 개의 토큰으로 인코딩하여 계획 속도를 느리게 하고 자원 소모를 크게 만듭니다. 이를 해결하기 위해 우리는 각 관측을 최소 8개의 토큰으로 압축하는 이산 토크나이저인 CompACT를 제안합니다. 이를 통해 계획에 필수적인 정보를 보존하면서 계산 비용을 획기적으로 줄입니다. CompACT 토크나이저를 활용한 행동 조건부 월드 모델은 기존 대비 수 배 빠른 계획 속도로 경쟁력 있는 계획 성능을 달성하여, 월드 모델의 실세계 배치를 위한 실용적인 단계를 제시합니다.

English

World models provide a powerful framework for simulating environment dynamics conditioned on actions or instructions, enabling downstream tasks such as action planning or policy learning. Recent approaches leverage world models as learned simulators, but its application to decision-time planning remains computationally prohibitive for real-time control. A key bottleneck lies in latent representations: conventional tokenizers encode each observation into hundreds of tokens, making planning both slow and resource-intensive. To address this, we propose CompACT, a discrete tokenizer that compresses each observation into as few as 8 tokens, drastically reducing computational cost while preserving essential information for planning. An action-conditioned world model that occupies CompACT tokenizer achieves competitive planning performance with orders-of-magnitude faster planning, offering a practical step toward real-world deployment of world models.

8개 토큰으로 구성하는 계획: 잠재 세계 모델을 위한 컴팩트 이산 토큰화 기법

Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

초록

Support