MedGemma 1.5 技術報告書

要旨

本稿では、MedGemmaコレクションの最新モデルであるMedGemma 1.5 4Bを紹介します。MedGemma 1.5は、MedGemma 1を拡張し、以下の追加機能を統合しました：高次元医療画像（CT/MRIボリュームおよび組織病理学的全スライド画像）、バウンディングボックスによる解剖学的位置同定、経時的な胸部X線分析、および改良された医療文書理解（検査報告書、電子健康記録）。単一アーキテクチャ内でこれらのモダリティを実現するために必要な革新、すなわち新規トレーニングデータ、長文脈3Dボリュームスライシング、全スライド病理サンプリングについて詳細に説明します。MedGemma 1 4Bと比較して、MedGemma 1.5 4Bはこれらの新領域で大幅な性能向上を示し、3D MRI状態分類精度を11%、3D CT状態分類を3%向上させました（絶対改善値）。全スライド病理画像では、47%のマクロF1スコア向上を達成しました。さらに、胸部X線におけるIntersection over Unionが35%増加し解剖学的位置同定が改善され、経時的（多時点）胸部X線分析では4%のマクロ精度を達成しました。MedGemma 1に対するマルチモーダル性能の改善に加えて、MedGemma 1.5はテキストベースの臨床知識と推論も改善し、MedQA精度で5%、EHRQA精度で22%向上しました。また、4つの異なる検査報告書情報抽出データセット（EHR Datasets 2, 3, 4, および Mendeley Clinical Laboratory Test Reports）において平均18%のマクロF1スコアを達成しました。総合すると、MedGemma 1.5は、開発者が次世代医療AIシステムを構築するための改良された基盤として設計された、コミュニティのための堅牢なオープンリソースとして機能します。MedGemma 1.5を基に構築するためのリソースとチュートリアルは、https://goo.gle/MedGemma でご覧いただけます。

English

We introduce MedGemma 1.5 4B, the latest model in the MedGemma collection. MedGemma 1.5 expands on MedGemma 1 by integrating additional capabilities: high-dimensional medical imaging (CT/MRI volumes and histopathology whole slide images), anatomical localization via bounding boxes, multi-timepoint chest X-ray analysis, and improved medical document understanding (lab reports, electronic health records). We detail the innovations required to enable these modalities within a single architecture, including new training data, long-context 3D volume slicing, and whole-slide pathology sampling. Compared to MedGemma 1 4B, MedGemma 1.5 4B demonstrates significant gains in these new areas, improving 3D MRI condition classification accuracy by 11% and 3D CT condition classification by 3% (absolute improvements). In whole slide pathology imaging, MedGemma 1.5 4B achieves a 47% macro F1 gain. Additionally, it improves anatomical localization with a 35% increase in Intersection over Union on chest X-rays and achieves a 4% macro accuracy for longitudinal (multi-timepoint) chest x-ray analysis. Beyond its improved multimodal performance over MedGemma 1, MedGemma 1.5 improves on text-based clinical knowledge and reasoning, improving by 5% on MedQA accuracy and 22% on EHRQA accuracy. It also achieves an average of 18% macro F1 on 4 different lab report information extraction datasets (EHR Datasets 2, 3, 4, and Mendeley Clinical Laboratory Test Reports). Taken together, MedGemma 1.5 serves as a robust, open resource for the community, designed as an improved foundation on which developers can create the next generation of medical AI systems. Resources and tutorials for building upon MedGemma 1.5 can be found at https://goo.gle/MedGemma.

MedGemma 1.5 技術報告書

MedGemma 1.5 Technical Report

要旨

Support