InfoAgent : Faire progresser les agents autonomes de recherche d'information

papers.abstract

La construction d'agents de grands modèles de langage qui étendent leurs capacités en interagissant avec des outils externes représente une nouvelle frontière dans la recherche et les applications de l'IA. Dans cet article, nous présentons InfoAgent, un agent de recherche approfondie alimenté par un pipeline innovant de synthèse de données et orchestré par des outils de recherche web. Pour construire des requêtes complexes et difficiles à trouver, nous élaborons des arbres d'entités et appliquons un échantillonnage de sous-arbres avec floutage d'entités afin d'augmenter systématiquement la difficulté des questions. Contrairement aux travaux antérieurs qui s'appuient fortement sur des outils de recherche commerciaux, nous développons une infrastructure de recherche auto-hébergée dédiée, améliorant la transparence des environnements des agents et facilitant l'avancement ultérieur de leurs capacités. Nous évaluons l'efficacité de notre pipeline de données en mesurant le nombre moyen d'appels d'outils nécessaires pour répondre correctement à une question, et montrons également que notre agent obtient de meilleures performances lorsqu'il est équipé de nos outils. Notre InfoAgent est post-entraîné à partir de Qwen3-14B en utilisant une recette en deux étapes : un fine-tuning supervisé à froid pour instiller des comportements de recherche à long terme, suivi d'un apprentissage par renforcement qui améliore significativement l'utilisation des outils pilotée par le raisonnement. Avec nos méthodes, InfoAgent atteint une précision de 15,3 % sur BrowseComp, 29,2 % sur BrowseComp-ZH et 40,4 % sur Xbench-DS, surpassant les agents de recherche approfondie open-source précédents tels que WebSailor-72B et DeepDive-32B.

English

Building Large Language Model agents that expand their capabilities by interacting with external tools represents a new frontier in AI research and applications. In this paper, we introduce InfoAgent, a deep research agent powered by an innovative data synthesis pipeline and orchestrated web search tools. To construct challenging, hard-to-find queries,we build entity trees and apply sub-tree sampling with entity fuzzification to systematically increase question difficulty. Unlike prior work that relies heavily on commercial search tools, we develop a dedicated self-hosted search infrastructure, enhancing transparency of agent environments and facilitating further advancement of agent capacity. We evaluate the effectiveness of our data pipeline by measuring the average number of tool calls required to correctly answer a question, and also show that our agent yields better performance when equipped with our tools. Our InfoAgent is post-trained from Qwen3-14B using a two-stage recipe: cold-start supervised finetuning to instill long-horizon search behaviors, followed by reinforcement learning which significantly improves reasoning-driven tool use. With our methods, InfoAgent achieves 15.3\% accuracy on BrowseComp, 29.2\% on BrowseComp-ZH, and 40.4\% on Xbench-DS, outperforming prior open-source deep research agents such as WebSailor-72B and DeepDive-32B.

InfoAgent : Faire progresser les agents autonomes de recherche d'information

InfoAgent: Advancing Autonomous Information-Seeking Agents

papers.abstract

Support