에이전트 AI를 위한 히치하이커 가이드: 기초부터 시스템까지

초록

히치하이커스 가이드 투 에이전틱 AI는 자율적 AI 시스템 구축을 위한 포괄적인 실무자 참고서다. 이 책은 단일 계층에 국한되지 않고 파이프라인의 모든 계층을 이해해야 훌륭한 에이전트 시스템을 구축할 수 있다는 핵심 주제를 중심으로, 첫 번째 원칙부터 프로덕션 배포까지 전체 스택을 다룬다. 책은 LLM 기반층(트랜스포머 아키텍처, GPU 시스템, 훈련 및 미세 조정(SFT, LoRA, MoE), 모델 압축, 추론 최적화)을 주요 초점이 아닌 필수적인 기반으로 다루며 시작한다. 이어서 정렬 및 추론 계층(인간 피드백 기반 강화학습(RLHF), PPO, DPO와 그 변형, GRPO, 보상 모델링, 사고 사슬 및 테스트 시간 스케일링을 포함한 대규모 추론 모델을 위한 강화학습)을 발전시킨다. 후반부는 본격적인 에이전틱 AI에 할애된다. 주제로는 에이전트 훈련 및 궤적 기반 강화학습, 검색 증강 생성(RAG 및 에이전틱 RAG), 메모리 시스템(맥락 내, 외부, 에피소드, 의미 메모리), 에이전트 하네스 설계 및 맥락 관리, 에이전트 설계 패턴의 분류 체계가 포함된다. 에이전트 간 조정은 심층적으로 다루어진다: 모델 컨텍스트 프로토콜(MCP), 에이전트 스킬 및 도구 사용, 에이전트 간(A2A) 통신 프로토콜, 그리고 중앙 집중형, 분산형, 계층형 토폴로지를 아우르는 다중 에이전트 아키텍처. 책은 에이전트 개발 프레임워크, 에이전틱 UI 설계, 에이전틱 작업을 위한 평가 방법론, 프로덕션 배포로 마무리된다. 각 장은 엄격한 이론적 기초와 구현 가이드, 코드 예제, 주요 문헌 참조를 함께 제공한다.

English

The Hitchhiker's Guide to Agentic AI is a comprehensive practitioner's reference for building autonomous AI systems. The book covers the full stack from first principles to production deployment, organized around a central thesis: building great agentic systems requires understanding every layer of the pipeline, not just one. The book opens with the LLM substrate -- transformer architecture, GPU systems, training and fine-tuning (SFT,LoRA, MoE), model compression, and inference optimization -- treated as essential foundations rather than the primary focus. It then develops the alignment and reasoning layer: reinforcement learning from human feedback (RLHF), PPO, DPO and its variants, GRPO, reward modeling, and RL for large reasoning models including chain-of-thought and test-time scaling. The second half is devoted to agentic AI proper. Topics include agentic training and trajectory-based RL, retrieval-augmented generation (RAG and Agentic RAG), memory systems (in-context, external, episodic, and semantic), agent harness design and context management, and a taxonomy of agent design patterns. Inter-agent coordination is covered in depth: the Model Context Protocol (MCP), agent skills and tool use, the Agent-to-Agent (A2A) communication protocol, and multi-agent architectures spanning centralized, decentralized, and hierarchical topologies. The book concludes with agent development frameworks, agentic UI design, evaluation methodology for agentic tasks, and production deployment. Each chapter pairs rigorous theoretical foundations with implementation guidance, code examples, and references to the primary literature.