SDXL-Lightning: 점진적 적대적 확산 증류

초록

우리는 SDXL 기반의 1024px 텍스트-이미지 생성에서 1단계/소수 단계로 새로운 최첨단 성능을 달성하는 확산 증류 방법을 제안한다. 본 방법은 점진적 증류와 적대적 증류를 결합하여 품질과 모드 커버리지 간의 균형을 달성한다. 본 논문에서는 이론적 분석, 판별기 설계, 모델 공식화 및 훈련 기법에 대해 논의한다. 우리는 증류된 SDXL-Lightning 모델을 LoRA 및 완전한 UNet 가중치로 오픈소스화하였다.

English

We propose a diffusion distillation method that achieves new state-of-the-art in one-step/few-step 1024px text-to-image generation based on SDXL. Our method combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. In this paper, we discuss the theoretical analysis, discriminator design, model formulation, and training techniques. We open-source our distilled SDXL-Lightning models both as LoRA and full UNet weights.

SDXL-Lightning: 점진적 적대적 확산 증류

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

초록

Support