MCompassRAG: トピックメタデータをセマンティックコンパスとした段落レベル検索

要旨

検索拡張生成（RAG）システムは、文書の分割方法と検索方法に決定的に依存している。細粒度のチャンクは検索精度を向上させる一方、探索空間が拡大しレイテンシとコストが増大する。大きなチャンクは候補数を減らすが、各チャンクに複数のトピックが混在し、より多くのセマンティックノイズが生じるため、密な類似度の信頼性が低下する。このトレードオフは、大規模で異種なコーパスに対して高速かつ高精度な検索が求められる深層研究タスクにおいて、特に大きな制約となる。本稿では、関連エビデンスを選択するためのセマンティックコンパスとしてトピックレベルの信号を利用する、メタデータ誘導型検索フレームワークMCompassRAGを提案する。MCompassRAGは、クエリとノイズの多いチャンク埋め込み間のコサイン類似度のみに依存するのではなく、同一埋め込み空間内でチャンク表現をトピックメタデータで強化し、LLM教師蒸留を通じて軽量な検索器を訓練する。推論時には、追加のLLM呼び出しを必要とせずにトピック認識検索を実行し、効率とエビデンス品質の両方を向上させる。6つの複雑な検索ベンチマークにおいて、MCompassRAGは、最も強力な効率的RAGベースラインと比較して、平均8.24%の情報効率（IE）の向上を達成し、レイテンシは5分の1以下である。コードはhttps://github.com/AmirAbaskohi/MCompassRAGで公開されている。

English

Retrieval-augmented generation (RAG) systems depend critically on how documents are chunked and searched. Fine-grained chunks can improve retrieval precision but expand the search space, increasing latency and cost; larger chunks reduce the number of candidates but make dense similarity less reliable, as the representation for each chunk mixes multiple topics and introduces more semantic noise. This trade-off becomes especially limiting in deep research tasks, where retrieval must be both fast and precise across large, heterogeneous corpora. We introduce MCompassRAG, a metadata-guided retrieval framework that uses topic-level signals as a semantic compass for selecting relevant evidence. Instead of relying only on cosine similarity between queries and noisy chunk embeddings, MCompassRAG enriches chunk representations with topic metadata in the same embedding space and trains a lightweight retriever through LLM-teacher distillation. At inference time, MCompassRAG performs topic-aware retrieval without additional LLM calls, improving both efficiency and evidence quality. Across six complex retrieval benchmarks, MCompassRAG improves information efficiency (IE) by 8.24% on average with over 5 times lower latency than the strongest efficient RAG baselines. Code is available on https://github.com/AmirAbaskohi/MCompassRAG.