构建面向智能体的网络,而非为网络构建智能体。
Build the web for agents, not agents for the web
June 12, 2025
作者: Xing Han Lù, Gaurav Kamath, Marius Mosbach, Siva Reddy
cs.AI
摘要
近期,大型语言模型(LLMs)及其多模态对应物的进展,极大地激发了开发网络代理——即能在网络环境中自主导航并完成任务的人工智能系统——的兴趣。尽管在自动化复杂网络交互方面展现出巨大潜力,当前方法因人类设计界面与LLM能力之间的根本性不匹配而面临重大挑战。现有方法在处理网络输入的内在复杂性时显得力不从心,无论是解析庞大的DOM树、依赖附加信息增强的截图,还是完全通过API交互绕过用户界面。本立场文件主张网络代理研究应进行范式转变:与其迫使网络代理适应为人类设计的界面,不如开发一种专门优化代理能力的新交互范式。为此,我们引入了“代理式网络界面”(Agentic Web Interface, AWI)的概念,这是一种专为代理导航网站而设计的界面。我们确立了AWI设计的六项指导原则,强调安全性、效率及标准化,以兼顾所有主要利益相关者的利益。这一重构旨在克服现有界面的根本限制,为更高效、可靠且透明的网络代理设计铺平道路,这将是整个机器学习社区共同参与的合作努力。
English
Recent advancements in Large Language Models (LLMs) and multimodal
counterparts have spurred significant interest in developing web agents -- AI
systems capable of autonomously navigating and completing tasks within web
environments. While holding tremendous promise for automating complex web
interactions, current approaches face substantial challenges due to the
fundamental mismatch between human-designed interfaces and LLM capabilities.
Current methods struggle with the inherent complexity of web inputs, whether
processing massive DOM trees, relying on screenshots augmented with additional
information, or bypassing the user interface entirely through API interactions.
This position paper advocates for a paradigm shift in web agent research:
rather than forcing web agents to adapt to interfaces designed for humans, we
should develop a new interaction paradigm specifically optimized for agentic
capabilities. To this end, we introduce the concept of an Agentic Web Interface
(AWI), an interface specifically designed for agents to navigate a website. We
establish six guiding principles for AWI design, emphasizing safety,
efficiency, and standardization, to account for the interests of all primary
stakeholders. This reframing aims to overcome fundamental limitations of
existing interfaces, paving the way for more efficient, reliable, and
transparent web agent design, which will be a collaborative effort involving
the broader ML community.