에이전트 기반 검색을 위한 정보 검색 학습

초록

에이전트 기반 검색은 최근 복잡한 질문을 해결하기 위해 에이전트가 다단계 추론과 온디맨드 정보 검색을 교차하여 수행하는 강력한 패러다임으로 부상했습니다. 그러나 그 성공에도 불구하고, 에이전트 기반 검색을 위한 검색기를 어떻게 설계할지에 대한 연구는 여전히 미흡한 실정입니다. 기존 검색 에이전트는 일반적으로 유사도 기반 검색기에 의존하는데, 유사한 문단이 최종 답변 생성에 항상 유용한 것은 아닙니다. 본 논문에서는 에이전트 기반 검색에 특화된 새로운 검색기 훈련 프레임워크를 제안합니다. 단일 턴 검색 증강 생성(RAG)만을 위해 설계되어 지역적 문단 유용성에만 의존하는 기존 검색기와 달리, 본 연구에서는 다중 턴 에이전트 기반 검색에서 문단의 유용성을 측정하기 위해 지역적 질의-문단 관련성과 전역적 답변 정확성을 모두 활용합니다. 또한 검색 에이전트와 검색기가 양방향 및 반복적으로 최적화되는 반복적 훈련 전략을 도입합니다. 고정된 질문으로 단 한 번만 훈련되는 RAG 검색기와 달리, 우리의 검색기는 에이전트로부터 진화하고 더 높은 품질의 질의를 지속적으로 활용하여 개선됩니다. 7개의 단일 홉 및 다중 홉 질의응답 벤치마크에 대한 광범위한 실험을 통해 로 명명된 우리의 검색기가 다양한 검색 에이전트에서 강력한 베이스라인을 지속적으로 능가함을 입증합니다. 우리의 코드는 https://github.com/8421BCD/Agentic-R에서 확인할 수 있습니다.

English

Agentic search has recently emerged as a powerful paradigm, where an agent interleaves multi-step reasoning with on-demand retrieval to solve complex questions. Despite its success, how to design a retriever for agentic search remains largely underexplored. Existing search agents typically rely on similarity-based retrievers, while similar passages are not always useful for final answer generation. In this paper, we propose a novel retriever training framework tailored for agentic search. Unlike retrievers designed for single-turn retrieval-augmented generation (RAG) that only rely on local passage utility, we propose to use both local query-passage relevance and global answer correctness to measure passage utility in a multi-turn agentic search. We further introduce an iterative training strategy, where the search agent and the retriever are optimized bidirectionally and iteratively. Different from RAG retrievers that are only trained once with fixed questions, our retriever is continuously improved using evolving and higher-quality queries from the agent. Extensive experiments on seven single-hop and multi-hop QA benchmarks demonstrate that our retriever, termed , consistently outperforms strong baselines across different search agents. Our codes are available at: https://github.com/8421BCD/Agentic-R.

에이전트 기반 검색을 위한 정보 검색 학습

Agentic-R: Learning to Retrieve for Agentic Search

초록

Support