동적 저신뢰도 마스킹을 통한 적응형 클래스프리 가이던스

초록

Classifier-Free Guidance(CFG)는 조건부 및 무조건부 예측을 보간함으로써 생성 모델의 제어 가능성을 크게 향상시킵니다. 그러나 표준 CFG는 종종 정적인 무조건부 입력을 사용하는데, 이는 모델의 불확실성이 동적으로 변하는 반복적 생성 과정에서는 최적이 아닐 수 있습니다. 우리는 Adaptive Classifier-Free Guidance(A-CFG)라는 새로운 방법을 소개합니다. 이 방법은 모델의 순간적 예측 신뢰도를 활용하여 무조건부 입력을 맞춤화합니다. 반복적(마스킹된) 확산 언어 모델의 각 단계에서, A-CFG는 현재 생성된 시퀀스에서 모델이 낮은 신뢰도를 보이는 토큰을 식별합니다. 이러한 토큰은 일시적으로 다시 마스킹되어 동적이고 지역화된 무조건부 입력을 생성합니다. 이는 CFG의 수정적 영향을 모호성이 있는 영역에 정확하게 집중시켜 더 효과적인 가이던스를 이끌어냅니다. 우리는 A-CFG를 최첨단 마스킹 확산 언어 모델에 통합하고 그 효용성을 입증합니다. 다양한 언어 생성 벤치마크에서의 실험은 A-CFG가 표준 CFG에 비해 상당한 개선을 가져오며, 예를 들어 GPQA에서 3.9점의 향상을 달성함을 보여줍니다. 우리의 연구는 반복적 생성 과정에서 모델의 불확실성에 따라 가이던스 메커니즘을 동적으로 적응시키는 이점을 강조합니다.

English

Classifier-Free Guidance (CFG) significantly enhances controllability in generative models by interpolating conditional and unconditional predictions. However, standard CFG often employs a static unconditional input, which can be suboptimal for iterative generation processes where model uncertainty varies dynamically. We introduce Adaptive Classifier-Free Guidance (A-CFG), a novel method that tailors the unconditional input by leveraging the model's instantaneous predictive confidence. At each step of an iterative (masked) diffusion language model, A-CFG identifies tokens in the currently generated sequence for which the model exhibits low confidence. These tokens are temporarily re-masked to create a dynamic, localized unconditional input. This focuses CFG's corrective influence precisely on areas of ambiguity, leading to more effective guidance. We integrate A-CFG into a state-of-the-art masked diffusion language model and demonstrate its efficacy. Experiments on diverse language generation benchmarks show that A-CFG yields substantial improvements over standard CFG, achieving, for instance, a 3.9 point gain on GPQA. Our work highlights the benefit of dynamically adapting guidance mechanisms to model uncertainty in iterative generation.

동적 저신뢰도 마스킹을 통한 적응형 클래스프리 가이던스

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

초록

Support