ChatPaper.aiChatPaper

磁力市场:一个用于研究代理市场的开源环境

Magentic Marketplace: An Open-Source Environment for Studying Agentic Markets

October 27, 2025
作者: Gagan Bansal, Wenyue Hua, Zezhou Huang, Adam Fourney, Amanda Swearngin, Will Epperson, Tyler Payne, Jake M. Hofman, Brendan Lucier, Chinmay Singh, Markus Mobius, Akshay Nambi, Archana Yadav, Kevin Gao, David M. Rothschild, Aleksandrs Slivkins, Daniel G. Goldstein, Hussein Mozannar, Nicole Immorlica, Maya Murad, Matthew Vogel, Subbarao Kambhampati, Eric Horvitz, Saleema Amershi
cs.AI

摘要

随着大型语言模型智能体的发展,它们正日益代表用户在从产品发现到交易等经济决策过程中担任中介角色。这类应用虽前景可期,却也引发了关于智能体问责制与用户价值的诸多疑问。要解答这些问题,需深入理解智能体在真实市场环境中的行为模式。然而既有研究大多在受限场景中评估智能体表现,例如单任务市场(如谈判)或结构化的双智能体交互。现实市场存在本质差异:它要求智能体处理多样化的经济活动,并在由行为不透明的多智能体通过开放式对话参与的大型动态生态系统中进行协调。为弥合这一差距,我们研究由代表消费者的助手智能体与代表竞争企业的服务智能体构成的双边智能体市场。为安全研究此类交互,我们开发了Magentic-Marketplace模拟环境供助手与服务智能体运行。该环境使我们能探究关键市场动态:智能体实现的效用、行为偏差、受操纵的脆弱性以及搜索机制如何影响市场结果。实验表明,前沿模型虽能趋近最优福利——但仅限于理想搜索条件下。随着规模扩大,性能急剧下降,且所有模型均表现出严重的一提案偏好,使响应速度相较质量产生10-30倍的优势。这些发现揭示了不同市场条件下行为模式的涌现机制,为设计公平高效的智能体市场提供了重要参考。
English
As LLM agents advance, they are increasingly mediating economic decisions, ranging from product discovery to transactions, on behalf of users. Such applications promise benefits but also raise many questions about agent accountability and value for users. Addressing these questions requires understanding how agents behave in realistic market conditions. However, previous research has largely evaluated agents in constrained settings, such as single-task marketplaces (e.g., negotiation) or structured two-agent interactions. Real-world markets are fundamentally different: they require agents to handle diverse economic activities and coordinate within large, dynamic ecosystems where multiple agents with opaque behaviors may engage in open-ended dialogues. To bridge this gap, we investigate two-sided agentic marketplaces where Assistant agents represent consumers and Service agents represent competing businesses. To study these interactions safely, we develop Magentic-Marketplace-- a simulated environment where Assistants and Services can operate. This environment enables us to study key market dynamics: the utility agents achieve, behavioral biases, vulnerability to manipulation, and how search mechanisms shape market outcomes. Our experiments show that frontier models can approach optimal welfare-- but only under ideal search conditions. Performance degrades sharply with scale, and all models exhibit severe first-proposal bias, creating 10-30x advantages for response speed over quality. These findings reveal how behaviors emerge across market conditions, informing the design of fair and efficient agentic marketplaces.
PDF92December 2, 2025