De Hitchhiker's Guide naar Agentische AI: Van fundamenten tot systemen

Samenvatting

De Lifter's Gids voor Agentische AI is een uitgebreid naslagwerk voor praktijkmensen die autonome AI-systemen willen bouwen. Het boek behandelt de volledige stapeling van basisprincipes tot productie-implementatie, georganiseerd rond een centrale stelling: het bouwen van goede agentische systemen vereist inzicht in elke laag van de pijplijn, niet slechts één. Het boek opent met het LLM-substraat – transformatorarchitectuur, GPU-systemen, training en finetuning (SFT, LoRA, MoE), modelcompressie en inferentieoptimalisatie – behandeld als essentiële fundamenten in plaats van het primaire focuspunt. Vervolgens wordt de laag van afstemming en redenering ontwikkeld: versterkend leren van menselijke feedback (RLHF), PPO, DPO en zijn varianten, GRPO, beloningsmodellering en RL voor grote redeneermodellen, inclusief chain-of-thought en test-tijdsschaling. De tweede helft is gewijd aan agentische AI als zodanig. Onderwerpen zijn onder meer agentische training en trajectgebaseerd RL, retrieval-verrijkte generatie (RAG en Agentische RAG), geheugensystemen (in-context, extern, episodisch en semantisch), ontwerp van agentharnassen en contextbeheer, en een taxonomie van agentontwerppatronen. Inter-agentcoördinatie wordt diepgaand behandeld: het Model Context Protocol (MCP), agentvaardigheden en toolgebruik, het Agent-naar-Agent (A2A)-communicatieprotocol, en multi-agentarchitecturen die centrale, gedecentraliseerde en hiërarchische topologieën omvatten. Het boek sluit af met agentontwikkelingsframeworks, agentische UI-ontwerp, evaluatiemethodologie voor agentische taken en productie-implementatie. Elk hoofdstuk combineert rigoureuze theoretische fundamenten met implementatierichtlijnen, codevoorbeelden en verwijzingen naar de primaire literatuur.

English

The Hitchhiker's Guide to Agentic AI is a comprehensive practitioner's reference for building autonomous AI systems. The book covers the full stack from first principles to production deployment, organized around a central thesis: building great agentic systems requires understanding every layer of the pipeline, not just one. The book opens with the LLM substrate -- transformer architecture, GPU systems, training and fine-tuning (SFT,LoRA, MoE), model compression, and inference optimization -- treated as essential foundations rather than the primary focus. It then develops the alignment and reasoning layer: reinforcement learning from human feedback (RLHF), PPO, DPO and its variants, GRPO, reward modeling, and RL for large reasoning models including chain-of-thought and test-time scaling. The second half is devoted to agentic AI proper. Topics include agentic training and trajectory-based RL, retrieval-augmented generation (RAG and Agentic RAG), memory systems (in-context, external, episodic, and semantic), agent harness design and context management, and a taxonomy of agent design patterns. Inter-agent coordination is covered in depth: the Model Context Protocol (MCP), agent skills and tool use, the Agent-to-Agent (A2A) communication protocol, and multi-agent architectures spanning centralized, decentralized, and hierarchical topologies. The book concludes with agent development frameworks, agentic UI design, evaluation methodology for agentic tasks, and production deployment. Each chapter pairs rigorous theoretical foundations with implementation guidance, code examples, and references to the primary literature.