라이선싱 오라클을 활용한 언어 모델의 환각 현상 억제

초록

언어 모델은 놀라운 자연어 생성 능력을 보이지만, 구문적으로 일관된 응답을 생성함에도 불구하고 사실적으로 부정확한 정보를 생성하는 환각 현상에 취약한 한계를 지닙니다. 본 연구에서는 구조화된 지식 그래프에 대한 형식적 검증을 통해 사실적 제약을 적용하여 언어 모델의 환각 현상을 차단하도록 설계된 아키텍처 솔루션인 라이선싱 오라클을 소개합니다. 데이터 확장이나 미세 조정에 의존하는 통계적 접근법과 달리, 라이선싱 오라클은 모델의 생성 과정에 결정론적 검증 단계를 내장하여 사실적으로 정확한 주장만이 출력되도록 보장합니다. 우리는 라이선싱 오라클의 효과를 평가하기 위해 기준 언어 모델 생성, 사실 회상 미세 조정, 회피 행동 미세 조정, 검색 증강 생성(RAG) 등 여러 최신 방법과 비교 실험을 수행했습니다. 그 결과, RAG와 미세 조정이 성능을 향상시키기는 하지만 환각 현상을 근절하지는 못하는 반면, 라이선싱 오라클은 완벽한 회피 정밀도(AP = 1.0)와 0%의 오답률(FAR-NE = 0.0)을 달성하여 사실 응답에서 89.1%의 정확도로 유효한 주장만을 생성함을 입증했습니다. 이 연구는 라이선싱 오라클과 같은 아키텍처 혁신이 구조화된 지식 표현이 가능한 영역에서 환각 현상에 대한 필요 충분 조건을 제공하며, 통계적 방법이 따라올 수 없는 보장을 제공함을 보여줍니다. 비록 라이선싱 오라클이 사실 기반 영역의 환각 현상을 특수하게 해결하도록 설계되었지만, 그 프레임워크는 향후 AI 시스템에서 진실 제약 생성의 기초를 마련함으로써 신뢰할 수 있고 인식론적으로 근거 있는 모델로 나아가는 새로운 길을 제시합니다.

English

Language models exhibit remarkable natural language generation capabilities but remain prone to hallucinations, generating factually incorrect information despite producing syntactically coherent responses. This study introduces the Licensing Oracle, an architectural solution designed to stem hallucinations in LMs by enforcing truth constraints through formal validation against structured knowledge graphs. Unlike statistical approaches that rely on data scaling or fine-tuning, the Licensing Oracle embeds a deterministic validation step into the model's generative process, ensuring that only factually accurate claims are made. We evaluated the effectiveness of the Licensing Oracle through experiments comparing it with several state-of-the-art methods, including baseline language model generation, fine-tuning for factual recall, fine-tuning for abstention behavior, and retrieval-augmented generation (RAG). Our results demonstrate that although RAG and fine-tuning improve performance, they fail to eliminate hallucinations. In contrast, the Licensing Oracle achieved perfect abstention precision (AP = 1.0) and zero false answers (FAR-NE = 0.0), ensuring that only valid claims were generated with 89.1% accuracy in factual responses. This work shows that architectural innovations, such as the Licensing Oracle, offer a necessary and sufficient solution for hallucinations in domains with structured knowledge representations, offering guarantees that statistical methods cannot match. Although the Licensing Oracle is specifically designed to address hallucinations in fact-based domains, its framework lays the groundwork for truth-constrained generation in future AI systems, providing a new path toward reliable, epistemically grounded models.

라이선싱 오라클을 활용한 언어 모델의 환각 현상 억제

Stemming Hallucination in Language Models Using a Licensing Oracle

초록

Support