CORG: 복잡하고 상호 연관된 맥락에서 답변 생성하기

초록

실제 말뭉치에서는 지식이 문서 간에 빈번히 반복되지만, 모호한 명명, 구식 정보 또는 오류로 인해 불일치가 발생하여 문맥 간 복잡한 상호관계가 형성됩니다. 선행 연구에 따르면, 언어 모델은 이러한 복잡성을 다루는 데 어려움을 겪으며, 일반적으로 단일 요소에만 집중하는 경향이 있습니다. 우리는 이러한 관계를 네 가지 유형으로 분류합니다: 산만한 관계, 모호한 관계, 반사실적 관계, 그리고 중복된 관계. 우리의 분석 결과, 이러한 상호관계를 동시에 효과적으로 해결할 수 있는 단일 접근법은 없음이 밝혀졌습니다. 따라서 우리는 다중 문맥을 독립적으로 처리되는 그룹으로 조직하는 프레임워크인 Context Organizer(CORG)를 소개합니다. 이 설계는 모델이 관련된 모든 답변을 효율적으로 찾으면서도 모호성을 해소할 수 있도록 합니다. CORG는 그래프 생성기, 재순위 지정기, 그리고 집계기의 세 가지 주요 구성 요소로 이루어져 있습니다. 우리의 실험 결과, CORG는 성능과 효율성을 효과적으로 균형 잡으며, 기존의 그룹화 방법을 능가하고 더 많은 계산 자원을 요구하는 단일 문맥 접근법과 비슷한 결과를 달성함을 보여줍니다.

English

In a real-world corpus, knowledge frequently recurs across documents but often contains inconsistencies due to ambiguous naming, outdated information, or errors, leading to complex interrelationships between contexts. Previous research has shown that language models struggle with these complexities, typically focusing on single factors in isolation. We classify these relationships into four types: distracting, ambiguous, counterfactual, and duplicated. Our analysis reveals that no single approach effectively addresses all these interrelationships simultaneously. Therefore, we introduce Context Organizer (CORG), a framework that organizes multiple contexts into independently processed groups. This design allows the model to efficiently find all relevant answers while ensuring disambiguation. CORG consists of three key components: a graph constructor, a reranker, and an aggregator. Our results demonstrate that CORG balances performance and efficiency effectively, outperforming existing grouping methods and achieving comparable results to more computationally intensive, single-context approaches.

CORG: 복잡하고 상호 연관된 맥락에서 답변 생성하기

CORG: Generating Answers from Complex, Interrelated Contexts

초록

Support