大規模言語モデルを用いたVibeコーディングの調査

要旨

大規模言語モデル（LLM）の進化は、コード生成支援から自律的なコーディングエージェントへのパラダイムシフトを引き起こし、「Vibe Coding」と呼ばれる新しい開発方法論を可能にしました。この方法論では、開発者がAIが生成した実装を、コードを一行ずつ理解するのではなく、結果の観察を通じて検証します。この革新的なパラダイムには変革の可能性があるものの、その有効性はまだ十分に検証されておらず、実証研究では予期せぬ生産性の低下や人間とAIの協業における根本的な課題が明らかになっています。このギャップを埋めるため、本調査は大規模言語モデルを用いたVibe Codingに関する初の包括的かつ体系的なレビューを提供し、この変革的な開発アプローチの理論的基盤と実践的フレームワークを確立します。1000以上の研究論文の体系的分析に基づき、Vibe Codingのエコシステム全体を調査し、コーディング用LLM、LLMベースのコーディングエージェント、コーディングエージェントの開発環境、フィードバックメカニズムといった重要なインフラストラクチャ要素を検証します。まず、Vibe Codingを正式な学問分野として紹介し、人間の開発者、ソフトウェアプロジェクト、コーディングエージェントの間の動的な三者関係を捉える制約付きマルコフ決定過程を通じて形式化します。この理論的基盤を踏まえ、既存の実践を5つの異なる開発モデル（無制限自動化、反復的対話型協業、計画主導型、テスト駆動型、コンテキスト強化型モデル）に統合し、この領域で初の包括的な分類体系を提供します。重要な点として、我々の分析は、Vibe Codingの成功が単にエージェントの能力に依存するのではなく、体系的なコンテキストエンジニアリング、確立された開発環境、人間とエージェントの協業開発モデルに依存することを明らかにしています。

English

The advancement of large language models (LLMs) has catalyzed a paradigm shift from code generation assistance to autonomous coding agents, enabling a novel development methodology termed "Vibe Coding" where developers validate AI-generated implementations through outcome observation rather than line-by-line code comprehension. Despite its transformative potential, the effectiveness of this emergent paradigm remains under-explored, with empirical evidence revealing unexpected productivity losses and fundamental challenges in human-AI collaboration. To address this gap, this survey provides the first comprehensive and systematic review of Vibe Coding with large language models, establishing both theoretical foundations and practical frameworks for this transformative development approach. Drawing from systematic analysis of over 1000 research papers, we survey the entire vibe coding ecosystem, examining critical infrastructure components including LLMs for coding, LLM-based coding agent, development environment of coding agent, and feedback mechanisms. We first introduce Vibe Coding as a formal discipline by formalizing it through a Constrained Markov Decision Process that captures the dynamic triadic relationship among human developers, software projects, and coding agents. Building upon this theoretical foundation, we then synthesize existing practices into five distinct development models: Unconstrained Automation, Iterative Conversational Collaboration, Planning-Driven, Test-Driven, and Context-Enhanced Models, thus providing the first comprehensive taxonomy in this domain. Critically, our analysis reveals that successful Vibe Coding depends not merely on agent capabilities but on systematic context engineering, well-established development environments, and human-agent collaborative development models.

大規模言語モデルを用いたVibeコーディングの調査

A Survey of Vibe Coding with Large Language Models

要旨

Support