대규모 언어 모델을 활용한 Vibe 코딩에 관한 연구 동향

초록

대규모 언어 모델(LLM)의 발전은 코드 생성 지원에서 자율 코딩 에이전트로의 패러다임 전환을 촉진하며, 개발자가 코드를 한 줄씩 이해하기보다는 AI가 생성한 구현 결과를 관찰하여 검증하는 "바이브 코딩(Vibe Coding)"이라는 새로운 개발 방법론을 가능하게 했습니다. 그러나 이러한 혁신적인 패러다임의 효과성은 아직 충분히 탐구되지 않았으며, 실증 연구에서는 예상치 못한 생산성 저하와 인간-AI 협업의 근본적인 문제점들이 드러나고 있습니다. 이러한 격차를 해결하기 위해, 본 설문 조사는 대규모 언어 모델을 활용한 바이브 코딩에 대한 최초의 포괄적이고 체계적인 리뷰를 제공하며, 이 혁신적인 개발 접근법에 대한 이론적 기반과 실용적 프레임워크를 확립합니다. 1000편 이상의 연구 논문을 체계적으로 분석하여, 코딩용 LLM, LLM 기반 코딩 에이전트, 코딩 에이전트 개발 환경, 피드백 메커니즘 등 바이브 코딩 생태계의 핵심 인프라 요소들을 조사합니다. 먼저, 인간 개발자, 소프트웨어 프로젝트, 코딩 에이전트 간의 동적 삼자 관계를 포착하는 제약 마르코프 결정 프로세스(Constrained Markov Decision Process)를 통해 바이브 코딩을 공식화함으로써 이를 하나의 공식 학문 분야로 소개합니다. 이 이론적 기반을 바탕으로, 기존 실무를 무제약 자동화, 반복적 대화형 협업, 계획 주도형, 테스트 주도형, 컨텍스트 강화 모델 등 5가지 독특한 개발 모델로 종합하여 이 분야의 최초의 포괄적 분류체계를 제공합니다. 특히, 우리의 분석은 성공적인 바이브 코딩이 단순히 에이전트의 능력에 의존하는 것이 아니라 체계적인 컨텍스트 엔지니어링, 잘 구축된 개발 환경, 그리고 인간-에이전트 협업 개발 모델에 달려 있음을 보여줍니다.

English

The advancement of large language models (LLMs) has catalyzed a paradigm shift from code generation assistance to autonomous coding agents, enabling a novel development methodology termed "Vibe Coding" where developers validate AI-generated implementations through outcome observation rather than line-by-line code comprehension. Despite its transformative potential, the effectiveness of this emergent paradigm remains under-explored, with empirical evidence revealing unexpected productivity losses and fundamental challenges in human-AI collaboration. To address this gap, this survey provides the first comprehensive and systematic review of Vibe Coding with large language models, establishing both theoretical foundations and practical frameworks for this transformative development approach. Drawing from systematic analysis of over 1000 research papers, we survey the entire vibe coding ecosystem, examining critical infrastructure components including LLMs for coding, LLM-based coding agent, development environment of coding agent, and feedback mechanisms. We first introduce Vibe Coding as a formal discipline by formalizing it through a Constrained Markov Decision Process that captures the dynamic triadic relationship among human developers, software projects, and coding agents. Building upon this theoretical foundation, we then synthesize existing practices into five distinct development models: Unconstrained Automation, Iterative Conversational Collaboration, Planning-Driven, Test-Driven, and Context-Enhanced Models, thus providing the first comprehensive taxonomy in this domain. Critically, our analysis reveals that successful Vibe Coding depends not merely on agent capabilities but on systematic context engineering, well-established development environments, and human-agent collaborative development models.

대규모 언어 모델을 활용한 Vibe 코딩에 관한 연구 동향

A Survey of Vibe Coding with Large Language Models

초록

Support