MCompassRAG: 문단 수준 검색을 위한 의미적 나침반으로서의 주제 메타데이터

초록

검색 증강 생성(RAG) 시스템은 문서가 어떻게 청크(chunk)로 분할되고 검색되는지에 크게 의존한다. 세밀한 청크는 검색 정밀도를 향상시킬 수 있지만 검색 공간을 확장하여 지연 시간과 비용을 증가시키고, 큰 청크는 후보 수를 줄이지만 각 청크의 표현이 여러 주제를 혼합하고 더 많은 의미적 잡음을 유발하므로 밀집 유사도의 신뢰성이 낮아진다. 이러한 절충은 크고 이질적인 말뭉치에서 검색이 빠르고 정밀해야 하는 심층 연구 과제에서 특히 제한적이다. 우리는 MCompassRAG를 소개한다. 이는 주제 수준의 신호를 관련 증거를 선택하기 위한 의미적 나침반으로 사용하는 메타데이터 기반 검색 프레임워크이다. MCompassRAG는 질의와 잡음이 있는 청크 임베딩 간의 코사인 유사도에만 의존하는 대신, 동일한 임베딩 공간에서 청크 표현을 주제 메타데이터로 강화하고 LLM-교사 증류(LLM-teacher distillation)를 통해 경량 검색기를 훈련시킨다. 추론 시 MCompassRAG는 추가적인 LLM 호출 없이 주제 인식 검색을 수행하여 효율성과 증거 품질을 모두 개선한다. 여섯 가지 복잡한 검색 벤치마크에서 MCompassRAG는 가장 강력한 효율적 RAG 기준선보다 5배 이상 낮은 지연 시간으로 정보 효율성(Information Efficiency, IE)을 평균 8.24% 향상시킨다. 코드는 https://github.com/AmirAbaskohi/MCompassRAG에서 확인할 수 있다.

English

Retrieval-augmented generation (RAG) systems depend critically on how documents are chunked and searched. Fine-grained chunks can improve retrieval precision but expand the search space, increasing latency and cost; larger chunks reduce the number of candidates but make dense similarity less reliable, as the representation for each chunk mixes multiple topics and introduces more semantic noise. This trade-off becomes especially limiting in deep research tasks, where retrieval must be both fast and precise across large, heterogeneous corpora. We introduce MCompassRAG, a metadata-guided retrieval framework that uses topic-level signals as a semantic compass for selecting relevant evidence. Instead of relying only on cosine similarity between queries and noisy chunk embeddings, MCompassRAG enriches chunk representations with topic metadata in the same embedding space and trains a lightweight retriever through LLM-teacher distillation. At inference time, MCompassRAG performs topic-aware retrieval without additional LLM calls, improving both efficiency and evidence quality. Across six complex retrieval benchmarks, MCompassRAG improves information efficiency (IE) by 8.24% on average with over 5 times lower latency than the strongest efficient RAG baselines. Code is available on https://github.com/AmirAbaskohi/MCompassRAG.