スパースデータ、豊かな結果：クラス条件付き画像変換によるFew-Shot半教師あり学習

要旨

深層学習は医療画像分野に革命をもたらしたが、その有効性はラベル付き訓練データの不足によって大きく制限されている。本論文では、クラスあたり5～50のラベル付きサンプルという低ラベルデータ環境に特化した、新規のGANベースの半教師あり学習フレームワークを提案する。本手法は、クラス条件付き画像変換のためのジェネレータ、真正性評価と分類のためのディスクリミネータ、専用の分類器という3つの特殊化されたニューラルネットワークを、3段階の訓練フレームワーク内に統合する。この方法は、限られたラベル付きデータを用いた教師あり学習と、ノイズからの生成ではなく画像間変換を通じて豊富なラベルなし画像を活用する教師なし学習を交互に行う。我々は、ディスクリミネータと分類器からの信頼度重み付き予測を指数移動平均による時間的一貫性と組み合わせたアンサンブルベースの擬似ラベル付けを採用し、ラベルなしデータに対する信頼性の高いラベル推定を可能にする。11のMedMNISTデータセットを用いた包括的評価により、本手法が6つの最先端のGANベース半教師あり手法を統計的に有意に上回り、特にラベル付きデータの不足が最も深刻な5ショット設定において強力な性能を発揮することが示された。本フレームワークは、評価されたすべての設定（クラスあたり5、10、20、50ショット）において優位性を維持する。本手法は、アノテーションコストが過大な医療画像アプリケーションにおいて、最小限のラベル付きデータでも堅牢な分類性能を実現する実用的なソリューションを提供する。コードはhttps://github.com/GuidoManni/SPARSEで公開されている。

English

Deep learning has revolutionized medical imaging, but its effectiveness is severely limited by insufficient labeled training data. This paper introduces a novel GAN-based semi-supervised learning framework specifically designed for low labeled-data regimes, evaluated across settings with 5 to 50 labeled samples per class. Our approach integrates three specialized neural networks -- a generator for class-conditioned image translation, a discriminator for authenticity assessment and classification, and a dedicated classifier -- within a three-phase training framework. The method alternates between supervised training on limited labeled data and unsupervised learning that leverages abundant unlabeled images through image-to-image translation rather than generation from noise. We employ ensemble-based pseudo-labeling that combines confidence-weighted predictions from the discriminator and classifier with temporal consistency through exponential moving averaging, enabling reliable label estimation for unlabeled data. Comprehensive evaluation across eleven MedMNIST datasets demonstrates that our approach achieves statistically significant improvements over six state-of-the-art GAN-based semi-supervised methods, with particularly strong performance in the extreme 5-shot setting where the scarcity of labeled data is most challenging. The framework maintains its superiority across all evaluated settings (5, 10, 20, and 50 shots per class). Our approach offers a practical solution for medical imaging applications where annotation costs are prohibitive, enabling robust classification performance even with minimal labeled data. Code is available at https://github.com/GuidoManni/SPARSE.

スパースデータ、豊かな結果：クラス条件付き画像変換によるFew-Shot半教師あり学習

SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation

要旨

Support