optimize_anything: あらゆるテキストパラメータを最適化するためのユニバーサルAPI

要旨

単一のLLMベースの最適化システムは、根本的に異なる領域において専門ツールに匹敵できるのか。本稿では、最適化問題をスコア関数で評価されるテキスト成果物の改善として定式化した場合、単一のAIベース最適化システム（単一タスク探索、問題間転移を伴うマルチタスク探索、未見入力への一般化をサポート）が、6つの多様なタスクにわたって最先端の成果を達成することを示す。このシステムは、ジェミニ・フラッシュのARC-AGI精度を32.5%から89.5%へとほぼ3倍に向上させるエージェントアーキテクチャを発見し、クラウドコストを40%削減するスケジューリングアルゴリズムを見つけ出し、87%がPyTorchと同等かそれ以上の性能を発揮するCUDAカーネルを生成し、AlphaEvolveが報告した円充填解（n=26）を上回る。3つの領域にわたるアブレーション解析から、実行可能なサイド情報がスコアのみのフィードバックよりも速い収束と実質的に高い最終スコアをもたらすこと、また、マルチタスク探索は、問題間転移を通じて同一のタスク当たり予算を与えられた場合に独立した最適化よりも優れており、その利点は関連タスクの数に応じて拡大することが明らかになった。これらの結果を総合すると、LLMベースの探索を用いたテキスト最適化は汎用問題解決パラダイムであり、従来は領域固有のアルゴリズムを必要としていたタスクを単一のフレームワークの下で統合することを初めて示す。我々は、GEPAプロジェクトの一部として、複数のバックエンドをサポートするoptimize\_anythingをhttps://github.com/gepa-ai/gepaでオープンソース公開する。

English

Can a single LLM-based optimization system match specialized tools across fundamentally different domains? We show that when optimization problems are formulated as improving a text artifact evaluated by a scoring function, a single AI-based optimization system-supporting single-task search, multi-task search with cross-problem transfer, and generalization to unseen inputs-achieves state-of-the-art results across six diverse tasks. Our system discovers agent architectures that nearly triple Gemini Flash's ARC-AGI accuracy (32.5% to 89.5%), finds scheduling algorithms that cut cloud costs by 40%, generates CUDA kernels where 87% match or beat PyTorch, and outperforms AlphaEvolve's reported circle packing solution (n=26). Ablations across three domains reveal that actionable side information yields faster convergence and substantially higher final scores than score-only feedback, and that multi-task search outperforms independent optimization given equivalent per-problem budget through cross-task transfer, with benefits scaling with the number of related tasks. Together, we show for the first time that text optimization with LLM-based search is a general-purpose problem-solving paradigm, unifying tasks traditionally requiring domain-specific algorithms under a single framework. We open-source optimize\_anything with support for multiple backends as part of the GEPA project at https://github.com/gepa-ai/gepa .