MOOZY: 컴퓨테이션 병리학을 위한 환자 중심 파운데이션 모델

초록

계산병리학은 다양한 임상 과제에 걸쳐 전이 가능한 전체 슬라이드 이미지(WSI) 기초 모델이 필요하지만, 현재의 접근법은 여전히 대부분 슬라이드 중심이며, 비공개 데이터와 고비용의 대응 보고서 감독에 의존하는 경우가 많고, 동일 환자로부터 얻은 여러 슬라이드 간의 관계를 명시적으로 모델링하지 않습니다. 본 연구에서는 개별 슬라이드가 아닌 환자 사례를 표현의 핵심 단위로 하는 환자 중심 병리 기초 모델인 MOOZY를 제시합니다. MOOZY는 프리트레이닝 과정에서 케이스 트랜스포머를 통해 동일 환자의 모든 슬라이드 간 의존성을 명시적으로 모델링하며, 다단계 공개 자기 지도 학습과 확장된 저비용 과제 감독을 결합합니다. 1단계에서는 77,134개의 공개 슬라이드 특징 그리드에 대해 마스킹 자기 증류 방식을 적용하여 비전 전용 슬라이드 인코더를 프리트레이닝합니다. 2단계에서는 케이스 트랜스포머와 56개 공개 데이터셋으로부터 추출한 333개 과제(205개 분류 과제 및 4개 종단점에 대한 128개 생존 분석 과제 포함)에 대한 다중 과제 감독을 통해 이러한 표현을 임상 의미론과 정렬합니다. 8개의 별도 평가 과제에 대해 5-겹 고정 특징 탐색 평가를 수행한 결과, MOOZY는 대부분의 지표에서 최고 또는 공동 최고 성능을 달성했으며, TITAN 대비 가중치 F1, 가중치 ROC-AUC, 균형 정확도의 매크로 평균을 각각 +7.37%, +5.50%, +7.83% 향상시켰고, PRISM 대비 각각 +8.83%, +10.70%, +9.78% 향상시켰습니다. MOOZY는 8,577만 개의 매개변수로 매개변수 효율적이며, GigaPath보다 14배 작습니다. 이러한 결과는 공개적이고 재현 가능한 환자 수준의 프리트레이닝이 전이 가능한 임베딩을 생성함으로써, 확장 가능한 환자 중심 조직병리학 기초 모델을 구축하는 실용적인 경로를 제공함을 입증합니다.

English

Computational pathology needs whole-slide image (WSI) foundation models that transfer across diverse clinical tasks, yet current approaches remain largely slide-centric, often depend on private data and expensive paired-report supervision, and do not explicitly model relationships among multiple slides from the same patient. We present MOOZY, a patient-first pathology foundation model in which the patient case, not the individual slide, is the core unit of representation. MOOZY explicitly models dependencies across all slides from the same patient via a case transformer during pretraining, combining multi-stage open self-supervision with scaled low-cost task supervision. In Stage 1, we pretrain a vision-only slide encoder on 77,134 public slide feature grids using masked self-distillation. In Stage 2, we align these representations with clinical semantics using a case transformer and multi-task supervision over 333 tasks from 56 public datasets, including 205 classification and 128 survival tasks across four endpoints. Across eight held-out tasks with five-fold frozen-feature probe evaluation, MOOZY achieves best or tied-best performance on most metrics and improves macro averages over TITAN by +7.37%, +5.50%, and +7.83% and over PRISM by +8.83%, +10.70%, and +9.78% for weighted F1, weighted ROC-AUC, and balanced accuracy, respectively. MOOZY is also parameter efficient with 85.77M parameters, 14x smaller than GigaPath. These results demonstrate that open, reproducible patient-level pretraining yields transferable embeddings, providing a practical path toward scalable patient-first histopathology foundation models.

MOOZY: 컴퓨테이션 병리학을 위한 환자 중심 파운데이션 모델

MOOZY: A Patient-First Foundation Model for Computational Pathology

초록

Support