RankEvolve：大規模言語モデル駆動の進化による検索アルゴリズム発見の自動化

要旨

BM25やディリクレ平滑化を伴うクエリ尤度といった検索アルゴリズムは、依然として強力かつ効率的な第一段階のランキング手法であるが、その改善は主にパラメータチューニングと人間の直感に依存してきた。本研究では、評価器と進化的探索によって導かれる大規模言語モデルが、改良された語彙検索アルゴリズムを自動的に発見できるかどうかを調査する。我々は、AlphaEvolveに基づくプログラム進化手法であるRankEvolveを提案する。この手法では、候補となるランキングアルゴリズムが実行可能コードとして表現され、BEIRとBRIGHTに由来する12のIRデータセットにおける検索性能に基づいて、反復的に変異、組み換え、選択が行われる。RankEvolveは、BM25とディリクレ平滑化を伴うクエリ尤度という2つのシードプログラムから開始する。進化によって生成されたアルゴリズムは新規性と有効性を備え、BEIRおよびBRIGHTベンチマーク全体、ならびにTREC DL 19および20への良好な転移性能を示す。我々の結果は、評価器によって導かれるLLMプログラム進化が、新規なランキングアルゴリズムを自動発見するための現実的な道筋であることを示唆している。

English

Retrieval algorithms like BM25 and query likelihood with Dirichlet smoothing remain strong and efficient first-stage rankers, yet improvements have mostly relied on parameter tuning and human intuition. We investigate whether a large language model, guided by an evaluator and evolutionary search, can automatically discover improved lexical retrieval algorithms. We introduce RankEvolve, a program evolution setup based on AlphaEvolve, in which candidate ranking algorithms are represented as executable code and iteratively mutated, recombined, and selected based on retrieval performance across 12 IR datasets from BEIR and BRIGHT. RankEvolve starts from two seed programs: BM25 and query likelihood with Dirichlet smoothing. The evolved algorithms are novel, effective, and show promising transfer to the full BEIR and BRIGHT benchmarks as well as TREC DL 19 and 20. Our results suggest that evaluator-guided LLM program evolution is a practical path towards automatic discovery of novel ranking algorithms.

RankEvolve：大規模言語モデル駆動の進化による検索アルゴリズム発見の自動化

RankEvolve: Automating the Discovery of Retrieval Algorithms via LLM-Driven Evolution

要旨

Support