エンタープライズシステムは学習された世界モデルを必要とするか？ ─ ダイナミクスを推論するためのコンテキストの重要性

要旨

世界モデルは、エージェントが環境のダイナミクスを内面化することで、自身の行動の効果を予測することを可能にする。しかし、エンタープライズシステムにおいては、これらのダイナミクスは、デプロイメントごとに異なり時間とともに進化するテナント固有のビジネスロジックによって定義されることが多く、そのため過去の遷移で学習されたモデルはデプロイメントシフトに対して脆弱になる。我々は、世界モデルの文献が扱ってこなかった問いを提起する。すなわち、推論時にルールを読み取れる場合、エージェントは依然としてそれらを学習する必要があるのか？我々は、遷移ダイナミクスが設定可能かつ読み取り可能な環境において、ランタイム発見が、予測をアクティブなシステムインスタンスに基づかせることでオフライン学習を補完する、と主張し、実証する。我々は、エンタープライズ発見エージェントを提案する。これは、内面化された表現のみに依存するのではなく、システムの設定を読み取ることで、ランタイムに関連する遷移ダイナミクスを回復する。我々は、CascadeBenchを紹介する。これは、多様な合成環境上でWorld of Workflowsの評価方法論を採用した、エンタープライズカスケード予測のための推論重視ベンチマークである。これをデプロイメントシフト評価と併用し、オフライン学習された世界モデルは分布内では良好に動作するが、ダイナミクスの変化に伴い性能が低下する一方、発見ベースのエージェントは予測を現在のインスタンスに基づかせることでシフト下でもより堅牢であることを示す。我々の発見は、設定可能なエンタープライズ環境において、エージェントは固定された内面化ダイナミクスのみに依存すべきではなく、ランタイムに関連する遷移ロジックを発見するメカニズムを組み込むべきであることを示唆する。

English

World models enable agents to anticipate the effects of their actions by internalizing environment dynamics. In enterprise systems, however, these dynamics are often defined by tenant-specific business logic that varies across deployments and evolves over time, making models trained on historical transitions brittle under deployment shift. We ask a question the world-models literature has not addressed: when the rules can be read at inference time, does an agent still need to learn them? We argue, and demonstrate empirically, that in settings where transition dynamics are configurable and readable, runtime discovery complements offline training by grounding predictions in the active system instance. We propose enterprise discovery agents, which recover relevant transition dynamics at runtime by reading the system's configuration rather than relying solely on internalized representations. We introduce CascadeBench, a reasoning-focused benchmark for enterprise cascade prediction that adopts the evaluation methodology of World of Workflows on diverse synthetic environments, and use it together with deployment-shift evaluation to show that offline-trained world models can perform well in-distribution but degrade as dynamics change, whereas discovery-based agents are more robust under shift by grounding their predictions in the current instance. Our findings suggest that, in configurable enterprise environments, agents should not rely solely on fixed internalized dynamics, but should incorporate mechanisms for discovering relevant transition logic at runtime.

エンタープライズシステムは学習された世界モデルを必要とするか？ ─ ダイナミクスを推論するためのコンテキストの重要性

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

要旨

Support