ギガピクセル病理画像解析のためのマスク付きハードインスタンスマイニングを備えたマルチインスタンス学習フレームワーク

要旨

病理画像をギガピクセル級のWhole Slide Images（WSI）にデジタル化することは、計算病理学（CPath）に新たな道を開きました。陽性組織はギガピクセルWSIのごく一部しか占めないため、既存のMultiple Instance Learning（MIL）手法は通常、注意メカニズムを通じて重要なインスタンスを特定することに焦点を当てています。しかし、これにより分類が容易なインスタンスに偏りが生じ、難しいインスタンスが無視される傾向があります。最近の研究では、識別境界を正確にモデル化するためには難しい例が重要であることが示されています。この考え方をインスタンスレベルで適用し、我々はマスクされた難しいインスタンスマイニング（MHIM-MIL）を用いた新しいMILフレームワークを詳細に設計しました。このフレームワークは、一貫性制約を持つSiamese構造を利用して難しいインスタンスを探索します。MHIM-MILは、クラスを意識したインスタンス確率を使用し、モーメンタム教師を用いて重要なインスタンスをマスクし、暗黙的に難しいインスタンスをマイニングして学生モデルを訓練します。多様で冗長でない難しいインスタンスを得るために、大規模なランダムマスキングを採用し、グローバルリサイクルネットワークを使用してキーフィーチャーが失われるリスクを軽減します。さらに、学生は指数移動平均を使用して教師を更新し、新しい難しいインスタンスを特定して後続の訓練イテレーションに活用し、最適化を安定させます。がん診断、サブタイピング、生存分析タスク、および12のベンチマークでの実験結果は、MHIM-MILが最新の手法を性能と効率の両面で上回ることを示しています。コードは以下で公開されています: https://github.com/DearCaat/MHIM-MIL。

English

Digitizing pathological images into gigapixel Whole Slide Images (WSIs) has opened new avenues for Computational Pathology (CPath). As positive tissue comprises only a small fraction of gigapixel WSIs, existing Multiple Instance Learning (MIL) methods typically focus on identifying salient instances via attention mechanisms. However, this leads to a bias towards easy-to-classify instances while neglecting challenging ones. Recent studies have shown that hard examples are crucial for accurately modeling discriminative boundaries. Applying such an idea at the instance level, we elaborate a novel MIL framework with masked hard instance mining (MHIM-MIL), which utilizes a Siamese structure with a consistency constraint to explore the hard instances. Using a class-aware instance probability, MHIM-MIL employs a momentum teacher to mask salient instances and implicitly mine hard instances for training the student model. To obtain diverse, non-redundant hard instances, we adopt large-scale random masking while utilizing a global recycle network to mitigate the risk of losing key features. Furthermore, the student updates the teacher using an exponential moving average, which identifies new hard instances for subsequent training iterations and stabilizes optimization. Experimental results on cancer diagnosis, subtyping, survival analysis tasks, and 12 benchmarks demonstrate that MHIM-MIL outperforms the latest methods in both performance and efficiency. The code is available at: https://github.com/DearCaat/MHIM-MIL.

ギガピクセル病理画像解析のためのマスク付きハードインスタンスマイニングを備えたマルチインスタンス学習フレームワーク

Multiple Instance Learning Framework with Masked Hard Instance Mining for Gigapixel Histopathology Image Analysis

要旨

Support