生成AI 第2幕：テスト時スケーリングが認知エンジニアリングを推進する

要旨

第一世代の大規模言語モデル——生成AIの「第一幕」（2020-2023）と呼べるもの——は、膨大なパラメータとデータのスケーリングを通じて驚異的な成功を収めましたが、知識の遅延、浅い推論、制約された認知プロセスといった根本的な限界も露呈しました。この時代において、プロンプトエンジニアリングはAIとの主要なインターフェースとして登場し、自然言語を通じた対話レベルのコミュニケーションを可能にしました。現在、私たちは「第二幕」（2024年以降）の出現を目撃しています。ここでは、モデルが（潜在空間における）知識検索システムから、テストタイムスケーリング技術を介した思考構築エンジンへと移行しつつあります。この新しいパラダイムは、言語ベースの思考を通じてAIとのマインドレベルの接続を確立します。本論文では、認知エンジニアリングの概念的基盤を明確にし、なぜこの瞬間がその発展にとって重要なのかを説明します。包括的なチュートリアルと最適化された実装を通じて、これらの先進的なアプローチを体系的に分解し、認知エンジニアリングへのアクセスを民主化し、すべての実践者がAIの第二幕に参加できるようにします。テストタイムスケーリングに関する論文の定期的に更新されるコレクションをGitHubリポジトリで提供しています：https://github.com/GAIR-NLP/cognition-engineering

English

The first generation of Large Language Models - what might be called "Act I" of generative AI (2020-2023) - achieved remarkable success through massive parameter and data scaling, yet exhibited fundamental limitations in knowledge latency, shallow reasoning, and constrained cognitive processes. During this era, prompt engineering emerged as our primary interface with AI, enabling dialogue-level communication through natural language. We now witness the emergence of "Act II" (2024-present), where models are transitioning from knowledge-retrieval systems (in latent space) to thought-construction engines through test-time scaling techniques. This new paradigm establishes a mind-level connection with AI through language-based thoughts. In this paper, we clarify the conceptual foundations of cognition engineering and explain why this moment is critical for its development. We systematically break down these advanced approaches through comprehensive tutorials and optimized implementations, democratizing access to cognition engineering and enabling every practitioner to participate in AI's second act. We provide a regularly updated collection of papers on test-time scaling in the GitHub Repository: https://github.com/GAIR-NLP/cognition-engineering

生成AI 第2幕：テスト時スケーリングが認知エンジニアリングを推進する

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

要旨

Support