Socratic-SWE: トレースから導出されたエージェントスキルに基づく自己進化型コーディングエージェント

要旨

LLM駆動型ソフトウェアエンジニアリングエージェントは、現実世界における言語モデルの能力を評価するための中心的なテストベッドとなっているが、その訓練は高品質なソフトウェアエンジニアリングタスクの入手可能性によって依然として制限されている。既存の合成データ手法は、通常、固定された変異やバグ注入手順を通じてタスクを作成するため、生成される分布はエージェント自身の弱点や訓練の進捗からほぼ独立したものとなる。本稿では、Socratic-SWEを提案する。これは、エージェントの過去の解決トレースを訓練信号の源泉として再利用する閉ループ型自己進化フレームワークである。Socratic-SWEは、トレースを単に報酬計算のための証拠として扱うのではなく、それらを反復的な失敗や効果的な修復パターンを要約した構造化されたエージェントスキルへと蒸留する。これらのスキルは、実際のリポジトリ上で対象を絞った修復タスクの生成を導く。候補タスクは実行ベースの検証によってチェックされ、ソルバー勾配アライメント報酬でスコアリングされるため、保持されるタスクは検証可能であり、かつソルバーの改善に有用である。更新されたソルバーは新たなトレースを生成し、タスクカリキュラムが連続するラウンドにわたって適応することを可能にする。SWE-bench Verified、SWE-bench Lite、SWE-bench Pro、Terminal-Bench 2.0において、Socratic-SWEは同じ計算予算のもとで自己進化ベースラインを一貫して上回り、3回の反復後にSWE-bench Verifiedで50.40%に達した。これらの結果は、解決トレースが自己進化型SWEエージェントのためのスケーラブルな基盤として機能し得ることを示唆している。

English

LLM-driven software engineering agents have become a central testbed for real-world language-model capability, yet their training remains limited by the availability of high-quality SWE tasks. Existing synthetic data methods typically create tasks through fixed mutation or bug-injection procedures, making the resulting distributions largely independent of the agent's own weaknesses and training progress. We introduce Socratic-SWE, a closed-loop self-evolution framework that reuses the agent's historical solving traces as a source of training signal. Rather than treating traces only as evidence for reward computation, Socratic-SWE distills them into structured agent skills that summarize recurring failures and effective repair patterns. These skills then guide the generation of targeted repair tasks in real repositories. Candidate tasks are checked through execution-based validation and scored with a solver-gradient alignment reward, so that the retained tasks are both verifiable and useful for improving the Solver. The updated Solver produces new traces, enabling the task curriculum to adapt over successive rounds. Across SWE-bench Verified, SWE-bench Lite, SWE-bench Pro, and Terminal-Bench 2.0, Socratic-SWE consistently improves over self-evolving baselines under the same compute budget, reaching 50.40% on SWE-bench Verified after three iterations. These results suggest that solving traces can serve as a scalable substrate for self-evolving SWE agents.