희소 데이터, 풍부한 결과: 클래스 조건부 이미지 변환을 통한 소샷 준지도 학습

초록

딥러닝은 의료 영상 분야에 혁신을 가져왔지만, 레이블이 지정된 훈련 데이터의 부족으로 인해 그 효과가 심각하게 제한되고 있다. 본 논문은 클래스당 5개에서 50개의 레이블 샘플로 구성된 다양한 환경에서 평가된, 낮은 레이블 데이터 환경을 위해 특별히 설계된 새로운 GAN 기반의 준지도 학습 프레임워크를 소개한다. 우리의 접근 방식은 클래스 조건부 이미지 변환을 위한 생성기, 진위 여부 평가 및 분류를 위한 판별기, 그리고 전용 분류기로 구성된 세 가지 특수 신경망을 3단계 훈련 프레임워크 내에 통합한다. 이 방법은 제한된 레이블 데이터에 대한 지도 학습과 노이즈로부터의 생성이 아닌 이미지 간 변환을 통해 풍부한 비레이블 이미지를 활용하는 비지도 학습을 번갈아 수행한다. 우리는 판별기와 분류기의 신뢰도 가중 예측을 지수 이동 평균을 통해 시간적 일관성과 결합한 앙상블 기반의 가짜 레이블링을 사용하여, 비레이블 데이터에 대한 신뢰할 수 있는 레이블 추정을 가능하게 한다. 11개의 MedMNIST 데이터셋에 대한 포괄적인 평가를 통해, 우리의 접근 방식이 6개의 최신 GAN 기반 준지도 학습 방법보다 통계적으로 유의미한 성능 향상을 달성했으며, 특히 레이블 데이터의 부족이 가장 심각한 극단적인 5샷 환경에서 강력한 성능을 보임을 입증했다. 이 프레임워크는 평가된 모든 환경(클래스당 5, 10, 20, 50샷)에서 우수성을 유지한다. 우리의 접근 방식은 주석 비용이 과도한 의료 영상 응용 분야에서 실질적인 해결책을 제공하며, 최소한의 레이블 데이터로도 견고한 분류 성능을 가능하게 한다. 코드는 https://github.com/GuidoManni/SPARSE에서 확인할 수 있다.

English

Deep learning has revolutionized medical imaging, but its effectiveness is severely limited by insufficient labeled training data. This paper introduces a novel GAN-based semi-supervised learning framework specifically designed for low labeled-data regimes, evaluated across settings with 5 to 50 labeled samples per class. Our approach integrates three specialized neural networks -- a generator for class-conditioned image translation, a discriminator for authenticity assessment and classification, and a dedicated classifier -- within a three-phase training framework. The method alternates between supervised training on limited labeled data and unsupervised learning that leverages abundant unlabeled images through image-to-image translation rather than generation from noise. We employ ensemble-based pseudo-labeling that combines confidence-weighted predictions from the discriminator and classifier with temporal consistency through exponential moving averaging, enabling reliable label estimation for unlabeled data. Comprehensive evaluation across eleven MedMNIST datasets demonstrates that our approach achieves statistically significant improvements over six state-of-the-art GAN-based semi-supervised methods, with particularly strong performance in the extreme 5-shot setting where the scarcity of labeled data is most challenging. The framework maintains its superiority across all evaluated settings (5, 10, 20, and 50 shots per class). Our approach offers a practical solution for medical imaging applications where annotation costs are prohibitive, enabling robust classification performance even with minimal labeled data. Code is available at https://github.com/GuidoManni/SPARSE.

희소 데이터, 풍부한 결과: 클래스 조건부 이미지 변환을 통한 소샷 준지도 학습

SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation

초록

Support