エージェンティックAIのヒッチハイク・ガイド：基礎からシステムへ

要旨

『エージェンティックAIのヒッチハイカー・ガイド』は、自律型AIシステムを構築するための実践的なリファレンスである。本書は第一原理から本番環境へのデプロイに至るまでのフルスタックを、中心的なテーゼ、すなわち優れたエージェンティックシステムを構築するにはパイプラインの個別の層だけでなくすべての層を理解することが必要である、という視点に基づいて構成されている。序盤ではLLM基盤――トランスフォーマーアーキテクチャ、GPUシステム、学習とファインチューニング（SFT、LoRA、MoE）、モデル圧縮、推論最適化――を、主たる焦点ではなく必須の基礎として扱う。続いてアライメントと推論のレイヤーを展開する。すなわち、人間からのフィードバックによる強化学習（RLHF）、PPO、DPOおよびその派生、GRPO、報酬モデリング、そしてチェーン・オブ・ソートやテスト時スケーリングを含む大規模推論モデルのための強化学習である。後半はエージェンティックAIそのものに充てられる。トピックとしては、エージェンティック学習と軌跡ベースのRL、検索拡張生成（RAGおよびAgentic RAG）、メモリシステム（インコンテキスト、外部、エピソディック、セマンティック）、エージェントハーネスの設計とコンテキスト管理、エージェント設計パターンの分類が含まれる。エージェント間の協調については詳細に扱われる。モデルコンテキストプロトコル（MCP）、エージェントのスキルとツール使用、エージェント間（A2A）通信プロトコル、そして集中型、分散型、階層型トポロジを網羅するマルチエージェントアーキテクチャである。本書は、エージェント開発フレームワーク、エージェンティックUI設計、エージェンティックタスクの評価手法、そして本番運用で締めくくられる。各章では、厳密な理論的基盤と実装ガイダンス、コード例、一次文献への参照を組み合わせて提供する。

English

The Hitchhiker's Guide to Agentic AI is a comprehensive practitioner's reference for building autonomous AI systems. The book covers the full stack from first principles to production deployment, organized around a central thesis: building great agentic systems requires understanding every layer of the pipeline, not just one. The book opens with the LLM substrate -- transformer architecture, GPU systems, training and fine-tuning (SFT,LoRA, MoE), model compression, and inference optimization -- treated as essential foundations rather than the primary focus. It then develops the alignment and reasoning layer: reinforcement learning from human feedback (RLHF), PPO, DPO and its variants, GRPO, reward modeling, and RL for large reasoning models including chain-of-thought and test-time scaling. The second half is devoted to agentic AI proper. Topics include agentic training and trajectory-based RL, retrieval-augmented generation (RAG and Agentic RAG), memory systems (in-context, external, episodic, and semantic), agent harness design and context management, and a taxonomy of agent design patterns. Inter-agent coordination is covered in depth: the Model Context Protocol (MCP), agent skills and tool use, the Agent-to-Agent (A2A) communication protocol, and multi-agent architectures spanning centralized, decentralized, and hierarchical topologies. The book concludes with agent development frameworks, agentic UI design, evaluation methodology for agentic tasks, and production deployment. Each chapter pairs rigorous theoretical foundations with implementation guidance, code examples, and references to the primary literature.