ChatPaper.aiChatPaper

步调一致:优化扩散模型中的采样计划

Align Your Steps: Optimizing Sampling Schedules in Diffusion Models

April 22, 2024
作者: Amirmojtaba Sabour, Sanja Fidler, Karsten Kreis
cs.AI

摘要

扩散模型(DMs)已经成为视觉领域及其他领域中最先进的生成建模方法。DMs的一个关键缺点是其较慢的采样速度,依赖于通过大型神经网络进行许多顺序函数评估。从DMs中进行采样可以被视为通过离散化的噪声水平集合解决微分方程。虽然过去的研究主要集中在推导高效求解器上,但对于寻找最佳采样计划却鲜有关注,整个文献都依赖于手工制定的启发式方法。在这项工作中,我们首次提出了一种通用且有原则的方法来优化DMs的采样计划,以获得高质量的输出,称为“调整您的步骤”。我们利用随机微积分方法,并找到了针对不同求解器、训练过的DMs和数据集的最佳计划。我们在几个图像、视频以及2D玩具数据合成基准上评估了我们的新方法,使用了各种不同的采样器,并观察到我们优化的计划在几乎所有实验中均优于先前手工制定的计划。我们的方法展示了采样计划优化的潜力,特别是在少步骤合成方案中。
English
Diffusion models (DMs) have established themselves as the state-of-the-art generative modeling approach in the visual domain and beyond. A crucial drawback of DMs is their slow sampling speed, relying on many sequential function evaluations through large neural networks. Sampling from DMs can be seen as solving a differential equation through a discretized set of noise levels known as the sampling schedule. While past works primarily focused on deriving efficient solvers, little attention has been given to finding optimal sampling schedules, and the entire literature relies on hand-crafted heuristics. In this work, for the first time, we propose a general and principled approach to optimizing the sampling schedules of DMs for high-quality outputs, called Align Your Steps. We leverage methods from stochastic calculus and find optimal schedules specific to different solvers, trained DMs and datasets. We evaluate our novel approach on several image, video as well as 2D toy data synthesis benchmarks, using a variety of different samplers, and observe that our optimized schedules outperform previous hand-crafted schedules in almost all experiments. Our method demonstrates the untapped potential of sampling schedule optimization, especially in the few-step synthesis regime.

Summary

AI-Generated Summary

PDF241December 15, 2024