大規模視覚言語モデルにおける胸部X線推論のための視覚的アトリビューションの再考

要旨

大規模視覚言語モデル（LVLM）は医療応用において有望であるが、回答を視覚的証拠に忠実に基づかせることができないことは、臨床的信頼性に関する深刻な懸念を引き起こす。視覚的属性帰属手法はLVLMの予測を説明するために広く用いられているが、これらの説明が実際にモデルの判断の根拠となる視覚的証拠を反映しているかどうかは、内部モデルの推論に関する正解アノテーションが通常利用不可能であるため、ほとんど検証されていない。本稿では、胸部X線（CXR）推論におけるこの問題に取り組むため、専門家がアノテーションした領域が反事実編集によってモデルの予測に因果的に寄与していることが検証されたCXR-VQAサンプルのみを保持する因果評価フレームワークを開発する。このフレームワークを11種類の属性帰属手法、6つのオープンソースLVLM、および2つの出力モード（直接回答と段階的推論）にわたって適用した結果、既存の属性帰属手法はLVLMが使用する証拠を特定できないことが多いことがわかった。この問題に対処するため、不均衡最適輸送を用いて臨床的に意味のある解剖学的領域を特定し、標的介入を通じてモデル出力に対するそれらの因果効果を測定する概念ベースの属性帰属手法であるMedFocusを提案する。MedFocusは空間的、概念レベル、およびトークンレベルの属性を生成し、従来手法を大幅に上回る性能を示し、医療用LVLMのより信頼性の高い属性帰属への一歩となる。データとコードはhttps://github.com/gzxiong/medfocus/で公開している。

English

Large Vision Language Models (LVLMs) show promise in medical applications, but their inability to faithfully ground responses in visual evidence raises serious concerns about clinical trustworthiness. While visual attribution methods are widely used to explain LVLM predictions, whether these explanations actually reflect the visual evidence underlying the model's decision is largely unverified, since ground-truth annotations for internal model reasoning are typically unavailable. We address this question for chest X-ray (CXR) reasoning by developing a causal evaluation framework that retains only CXR-VQA samples for which the expert-annotated region is verified, via counterfactual editing, to be causally responsible for the model's prediction. Using this framework across 11 attribution methods, six open-source LVLMs, and two output modes (direct answer and step-by-step reasoning), we find that existing attribution methods often fail to identify the evidence used by LVLMs. To address this failure, we propose MedFocus, a concept-based attribution method that localizes clinically meaningful anatomical regions via unbalanced optimal transport and measures their causal effect on model outputs through targeted interventions. MedFocus produces spatial, concept-level, and token-level attributions and substantially outperforms prior methods, taking a step toward more trustworthy attribution for medical LVLMs. Our data and code are available at https://github.com/gzxiong/medfocus/.