量子ノイズ除去拡散モデル

要旨

近年、DALL-E、Craiyon、Stable Diffusionなどの機械学習モデルは、簡潔な記述から高解像度の画像を生成する能力により、大きな注目を集めています。同時に、量子コンピューティング、特に量子機械学習は、量子力学を活用して従来の機械学習アルゴリズムの増大する計算要件に対応する有望な進展を示しています。本論文では、量子機械学習と変分量子回路の統合を探求し、拡散ベースの画像生成モデルの効率を向上させることを目的としています。具体的には、古典的な拡散モデルが抱える2つの課題、すなわちサンプリング速度の低さと膨大なパラメータ要件に取り組みます。我々は2つの量子拡散モデルを導入し、MNIST数字、Fashion MNIST、CIFAR-10を用いて古典的なモデルと比較評価を行いました。我々のモデルは、同程度のパラメータ数を持つ古典モデルを性能指標FID、SSIM、PSNRにおいて上回りました。さらに、拡散プロセスを単一ステップに統合し、高速なワンステップ画像生成を可能にする一貫性モデルユニタリシングルサンプリングアーキテクチャを提案します。

English

In recent years, machine learning models like DALL-E, Craiyon, and Stable Diffusion have gained significant attention for their ability to generate high-resolution images from concise descriptions. Concurrently, quantum computing is showing promising advances, especially with quantum machine learning which capitalizes on quantum mechanics to meet the increasing computational requirements of traditional machine learning algorithms. This paper explores the integration of quantum machine learning and variational quantum circuits to augment the efficacy of diffusion-based image generation models. Specifically, we address two challenges of classical diffusion models: their low sampling speed and the extensive parameter requirements. We introduce two quantum diffusion models and benchmark their capabilities against their classical counterparts using MNIST digits, Fashion MNIST, and CIFAR-10. Our models surpass the classical models with similar parameter counts in terms of performance metrics FID, SSIM, and PSNR. Moreover, we introduce a consistency model unitary single sampling architecture that combines the diffusion procedure into a single step, enabling a fast one-step image generation.

量子ノイズ除去拡散モデル

Quantum Denoising Diffusion Models

要旨

Support