StructRAG: 추론 시 하이브리드 정보 구조화를 통해 LLMs의 지식 집약적 추론 강화

초록

검색 증강 생성 (RAG)은 많은 지식 기반 작업에서 대형 언어 모델 (LLM)을 효과적으로 향상시키는 주요 수단입니다. 그러나 기존의 RAG 방법은 지식 집약적 추론 작업에서 어려움을 겪습니다. 왜냐하면 이러한 작업에 필요한 유용한 정보가 나쁘게 흩어져 있기 때문입니다. 이 특성으로 인해 기존의 RAG 방법은 주요 정보를 정확하게 식별하고 이러한 노이즈가 있는 증강과 함께 전역 추론을 수행하는 데 어려움을 겪습니다. 본 논문에서는 지식 집약적 추론을 다룰 때 사람들이 원시 정보를 다양한 구조화된 지식으로 변환하는 인지 이론에 영감을 받아 작업에 최적의 구조 유형을 식별하고 이 구조화된 형식으로 원본 문서를 재구성하며 결과적인 구조를 기반으로 답변을 추론할 수 있는 새로운 프레임워크인 StructRAG을 제안합니다. 다양한 지식 집약적 작업을 통한 광범위한 실험 결과 StructRAG이 최첨단 성능을 달성하며 특히 어려운 시나리오에서 뛰어난 성과를 보여 복잡한 실제 응용 프로그램에서 LLM을 향상시키는 효과적인 솔루션으로의 잠재력을 입증합니다.

English

Retrieval-augmented generation (RAG) is a key means to effectively enhance large language models (LLMs) in many knowledge-based tasks. However, existing RAG methods struggle with knowledge-intensive reasoning tasks, because useful information required to these tasks are badly scattered. This characteristic makes it difficult for existing RAG methods to accurately identify key information and perform global reasoning with such noisy augmentation. In this paper, motivated by the cognitive theories that humans convert raw information into various structured knowledge when tackling knowledge-intensive reasoning, we proposes a new framework, StructRAG, which can identify the optimal structure type for the task at hand, reconstruct original documents into this structured format, and infer answers based on the resulting structure. Extensive experiments across various knowledge-intensive tasks show that StructRAG achieves state-of-the-art performance, particularly excelling in challenging scenarios, demonstrating its potential as an effective solution for enhancing LLMs in complex real-world applications.

StructRAG: 추론 시 하이브리드 정보 구조화를 통해 LLMs의 지식 집약적 추론 강화

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

초록

Support