SPAR: 향상된 학술 검색을 위한 LLM 기반 에이전트를 활용한 학술 논문 검색

초록

대규모 언어 모델(LLMs)의 최근 발전은 학술 문헌 검색에 새로운 기회를 열어주었다. 그러나 기존 시스템은 주로 경직된 파이프라인에 의존하며 제한된 추론 능력을 보인다. 본 연구에서는 보다 유연하고 효과적인 검색을 가능하게 하는 RefChain 기반 질의 분해와 질의 진화를 통합한 다중 에이전트 프레임워크인 SPAR를 소개한다. 체계적인 평가를 위해 전문가가 관련성 레이블을 부여한 도전적인 벤치마크인 SPARBench도 구축하였다. 실험 결과, SPAR는 강력한 베이스라인 대비 AutoScholar에서 최대 +56% F1, SPARBench에서 +23% F1의 성능 향상을 달성하며 크게 우수한 성능을 보였다. SPAR와 SPARBench는 학술 검색 연구를 발전시키기 위한 확장 가능하고 해석 가능하며 고성능의 기반을 제공한다. 코드와 데이터는 https://github.com/xiaofengShi/SPAR에서 확인할 수 있다.

English

Recent advances in large language models (LLMs) have opened new opportunities for academic literature retrieval. However, existing systems often rely on rigid pipelines and exhibit limited reasoning capabilities. We introduce SPAR, a multi-agent framework that incorporates RefChain-based query decomposition and query evolution to enable more flexible and effective search. To facilitate systematic evaluation, we also construct SPARBench, a challenging benchmark with expert-annotated relevance labels. Experimental results demonstrate that SPAR substantially outperforms strong baselines, achieving up to +56% F1 on AutoScholar and +23% F1 on SPARBench over the best-performing baseline. Together, SPAR and SPARBench provide a scalable, interpretable, and high-performing foundation for advancing research in scholarly retrieval. Code and data will be available at: https://github.com/xiaofengShi/SPAR

SPAR: 향상된 학술 검색을 위한 LLM 기반 에이전트를 활용한 학술 논문 검색

SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search

초록

Support