확률의 속박을 깨다: 대규모 언어 모델에서 인식론적 불확실성을 위한 새로운 프레임워크로서의 중성 논리

초록

대규모 언어 모델(LLM)은 주로 결과 확률의 합이 1로 제약되는 확률적 프레임워크에 의해 지배된다. 소프트맥스 층에 의해 종종 부과되는 이러한 구조적 한계는 불확실성의 붕괴를 초래하여 인식론적 불확실성, 역설, 모호성을 구분하기 어렵게 만든다. 본 연구는 진리(T), 불확정성(I), 거짓(F)을 세 가지 독립적 차원으로 다루는 중성 논리(Neutrosophic Logic) 프레임워크를 LLM의 인식 상태 모델링에 적용한 경험적 조사를 제시한다. 네 가지 OpenAI GPT 모델군을 대상으로 다섯 가지 언어 현상(논리적 역설, 인식론적 무지, 모호성, 윤리적 모순, 미래 우발성)에 대해 세 가지 프롬프팅 전략(중성적, 확률적, 엔트로피 기반)으로 실험을 수행하였다. 연구 결과, T+I+F > 1(본 연구에서 초진리(hyper-truth)라 명명한 상태)을 허용하는 중성적 접근법이 모델 내부 상태에 대한 더 풍부한 표현을 제공함을 확인하였다. 평가의 35%에서 초진리가 자발적으로 나타났으며, 주로 윤리적 모순과 논리적 역설 하에서 발생하였다. 이 접근법이 퍼지 맥락에서 진릿값을 보존하고 내부 모델 충돌을 식별 및 정량화하는 강건한 방법을 제공함을 입증한다. 결론적으로, 중성적 평가 층의 통합은 보다 투명하고 신뢰할 수 있으며 윤리적으로 민감한 AI 시스템을 향한 중요한 단계임을 주장한다.

English

Large Language Models (LLMs) are predominantly governed by probabilistic frameworks in which the sum of outcome probabilities is constrained to unity. This architectural limitation, often imposed by Softmax layers, leads to a collapse of uncertainty that makes it difficult to differentiate between epistemic uncertainty, paradox, and vagueness. We present an empirical investigation of the application of Neutrosophic Logic, a framework that treats Truth (T), Indeterminacy (I), and Falsity (F) as three independent dimensions, to model epistemic states in LLMs. We conducted experiments on a family of four OpenAI GPT models across five linguistic phenomena: logical paradoxes, epistemic ignorance, vagueness, ethical contradictions, and future contingencies, under three prompting strategies: neutrosophic, probabilistic, and entropy-derived. Our findings reveal that the neutrosophic approach, by allowing T+I+F > 1, a state we term hyper-truth, provides a richer representation of a model's internal state. In 35% of evaluations, hyper-truth emerged spontaneously, predominantly under ethical contradiction and logical paradox. We demonstrate that this approach preserves truth values in fuzzy contexts and offers a robust method for identifying and quantifying internal model conflict. We conclude that the integration of neutrosophic evaluation layers is a critical step toward more transparent, reliable, and ethically aware AI systems.