체인-오브-검증(Chain-of-Verification)은 대규모 언어 모델의 환각 현상을 줄여줍니다.

초록

그럴듯하지만 사실과 다른 정보를 생성하는 현상, 즉 환각(hallucination)은 대규모 언어 모델에서 아직 해결되지 않은 문제입니다. 우리는 언어 모델이 자신이 제공한 응답에 대해 숙고하여 실수를 수정할 수 있는 능력을 연구했습니다. 이를 위해 우리는 체인-오브-검증(Chain-of-Verification, CoVe) 방법을 개발했습니다. 이 방법은 모델이 먼저 (i) 초기 응답을 초안으로 작성한 다음, (ii) 이 초안을 사실 확인하기 위한 검증 질문을 계획하고, (iii) 다른 응답에 영향을 받지 않도록 독립적으로 이 질문에 답하며, (iv) 최종적으로 검증된 응답을 생성하는 과정으로 구성됩니다. 실험을 통해 CoVe가 Wikidata의 목록 기반 질문, 폐쇄형 MultiSpanQA, 그리고 장문 텍스트 생성 등 다양한 작업에서 환각 현상을 줄이는 것을 확인했습니다.

English

Generation of plausible yet incorrect factual information, termed hallucination, is an unsolved issue in large language models. We study the ability of language models to deliberate on the responses they give in order to correct their mistakes. We develop the Chain-of-Verification (CoVe) method whereby the model first (i) drafts an initial response; then (ii) plans verification questions to fact-check its draft; (iii) answers those questions independently so the answers are not biased by other responses; and (iv) generates its final verified response. In experiments, we show CoVe decreases hallucinations across a variety of tasks, from list-based questions from Wikidata, closed book MultiSpanQA and longform text generation.

체인-오브-검증(Chain-of-Verification)은 대규모 언어 모델의 환각 현상을 줄여줍니다.

Chain-of-Verification Reduces Hallucination in Large Language Models

초록

Support