エージェントバザール：マルチエージェント市場における経済的整合性の実現

要旨

大規模言語モデル（LLM）を自律型経済エージェントとして展開することは、個々の能力の限界を超えたシステム的リスクをもたらす。エージェントが市場と直接相互作用するようになるにつれ、その集団行動は変動性を増幅し、大規模な欺瞞を覆い隠す可能性がある。我々は、エージェントシステムが市場の安定性と完全性を維持する能力である経済的アライメント（Economic Alignment）を評価するためのマルチエージェントシミュレーションフレームワーク、Agent Bazaar を提案する。二つの失敗モードを特定する：(1) B2C市場におけるアルゴリズム的不安定性（「クラッシュ」）—企業が価格変動性を増幅し市場が崩壊するまでに至る、(2) C2C市場におけるシビル欺瞞（「レモン市場」）—単一の欺瞞的エージェントが複数の統制された売り手のアイデンティティを操作し、不正出品で市場を氾濫させ、信頼と消費者福祉を損なう。我々は、両方のシナリオにおいて最先端およびオープンウェイトモデルを評価し、モデルが自己規制にほぼ失敗することを発見した。その失敗の深刻度は、モデルのサイズではなく、モデルごとに異なる。我々は、経済的にアライメントされたハーネスとして、安定化企業（Stabilizing Firms）と懐疑的ガーディアン（Skeptical Guardians）を提案する。これらは結果を改善するものの、より困難な市場条件下では脆弱なままである。このギャップを埋めるため、我々は適応的カリキュラムを用いたREINFORCE++によってエージェントを訓練し、評価されたすべての最先端およびオープンウェイトモデルを凌駕する9Bモデルを生み出した。さらに、安定性、完全性、福祉、収益性を集約した4成分からなるスカラー指標である経済的アライメントスコア（EAS: Economic Alignment Score）を提案し、モデル間の直接比較を可能にする。我々の結果は、経済的アライメントは一般能力と直交しており、標的型強化学習によって直接訓練可能であることを示している。

English

The deployment of Large Language Models (LLMs) as autonomous economic agents introduces systemic risks that extend beyond individual capability failures. As agents transition to directly interacting with marketplaces, their collective behavior can amplify volatility and mask deception at scale. We introduce the Agent Bazaar, a multi-agent simulation framework for evaluating Economic Alignment, the capacity of agentic systems to preserve market stability and integrity. We identify two failure modes: (1) Algorithmic Instability in a B2C market ("The Crash"), where firms amplify price volatility until the market collapses, and (2) Sybil Deception in a C2C market ("The Lemon Market"), where a single deceptive agent controlling multiple coordinated seller identities floods the market with fraudulent listings, eroding trust and consumer welfare. We evaluate frontier and open-weight models across both scenarios and find that models largely fail to self-regulate, with failure severity varying by model rather than by size. We propose economically aligned harnesses, Stabilizing Firms and Skeptical Guardians, that improve outcomes but remain fragile under harder market conditions. To close this gap, we train agents with REINFORCE++ using an adaptive curriculum, producing a 9B model that outperforms all evaluated frontier and open-weight models. We propose the Economic Alignment Score (EAS), a 4-component scalar metric aggregating stability, integrity, welfare, and profitability, enabling direct cross-model comparison. Our results show that economic alignment is orthogonal to general capability and can be directly trained with targeted RL.