DoTA-RAG: 사고 집적 동적 RAG

초록

본 논문에서는 대규모 웹 지식 인덱스에 최적화된 고처리량 검색 강화 생성 시스템인 DoTA-RAG(Dynamic-of-Thought Aggregation RAG)를 소개한다. 기존 RAG 파이프라인은 대규모 및 다양한 데이터셋에서 높은 지연 시간과 제한된 정확도 문제를 겪는 경우가 많다. DoTA-RAG는 이러한 문제를 해결하기 위해 쿼리 재작성, 특화된 하위 인덱스로의 동적 라우팅, 그리고 다단계 검색 및 순위 지정의 세 단계 파이프라인을 도입한다. 또한, FineWeb-10BT 코퍼스를 재임베딩하여 우수한 임베딩 모델을 평가 및 선정함으로써 검색 성능을 더욱 향상시켰다. 더불어, 다양한 WebOrganizer 주제와 형식을 아우르는 DataMorgana 설정을 통해 생성된 500개의 질문으로 구성된 다양한 Q&A 데이터셋을 구축하였다. DoTA-RAG는 LiveRAG 사전 구축 벡터 저장소를 사용한 기준점 0.752에서 1.478로 답변 정확도 점수를 향상시키면서도 낮은 지연 시간을 유지하였으며, Live Challenge Day에서 0.929의 정확도 점수를 달성하였다. 이러한 결과는 DoTA-RAG가 빠르고 신뢰할 수 있는 대규모 및 진화하는 지식 소스 접근이 필요한 분야에서의 실용적 배포 가능성을 강조한다.

English

In this paper, we introduce DoTA-RAG (Dynamic-of-Thought Aggregation RAG), a retrieval-augmented generation system optimized for high-throughput, large-scale web knowledge indexes. Traditional RAG pipelines often suffer from high latency and limited accuracy over massive, diverse datasets. DoTA-RAG addresses these challenges with a three-stage pipeline: query rewriting, dynamic routing to specialized sub-indexes, and multi-stage retrieval and ranking. We further enhance retrieval by evaluating and selecting a superior embedding model, re-embedding the large FineWeb-10BT corpus. Moreover, we create a diverse Q&A dataset of 500 questions generated via the DataMorgana setup across a broad range of WebOrganizer topics and formats. DoTA-RAG improves the answer correctness score from 0.752 (baseline, using LiveRAG pre-built vector store) to 1.478 while maintaining low latency, and it achieves a 0.929 correctness score on the Live Challenge Day. These results highlight DoTA-RAG's potential for practical deployment in domains requiring fast, reliable access to large and evolving knowledge sources.

DoTA-RAG: 사고 집적 동적 RAG

DoTA-RAG: Dynamic of Thought Aggregation RAG

초록

Support