RadAgent : Un agent IA utilisant des outils pour l'interprétation pas à pas de la tomodensitométrie thoracique

Résumé

Les modèles vision-langage (VLM) ont considérablement fait progresser l'interprétation et la génération de comptes-rendus en imagerie médicale complexe, telle que la tomodensitométrie (TDM), grâce à l'intelligence artificielle. Cependant, les méthodes existantes cantonnent largement les cliniciens au rôle d'observateurs passifs des résultats finaux, sans leur fournir de trace de raisonnement interprétable à inspecter, valider ou affiner. Pour remédier à cela, nous présentons RadAgent, un agent d'IA utilisant des outils qui génère des comptes-rendus TDM via un processus étape par étape et interprétable. Chaque rapport produit est accompagné d'une trace entièrement inspectable des décisions intermédiaires et des interactions avec les outils, permettant aux cliniciens d'examiner comment les conclusions rapportées sont dérivées. Nos expériences montrent que RadAgent améliore la génération de comptes-rendus de TDM thoracique par rapport à son homologue VLM 3D, CT-Chat, sur trois dimensions. La précision clinique s'améliore de 6,0 points (36,4 % en relatif) en macro-F1 et de 5,4 points (19,6 % en relatif) en micro-F1. La robustesse dans des conditions adverses s'améliore de 24,7 points (41,9 % en relatif). De plus, RadAgent atteint 37,0 % en fidélité (faithfulness), une capacité entièrement absente chez son homologue VLM 3D. En structurant l'interprétation des TDM thoraciques comme une trace de raisonnement explicite, augmentée par des outils et itérative, RadAgent nous rapproche d'une IA transparente et fiable pour la radiologie.

English

Vision-language models (VLM) have markedly advanced AI-driven interpretation and reporting of complex medical imaging, such as computed tomography (CT). Yet, existing methods largely relegate clinicians to passive observers of final outputs, offering no interpretable reasoning trace for them to inspect, validate, or refine. To address this, we introduce RadAgent, a tool-using AI agent that generates CT reports through a stepwise and interpretable process. Each resulting report is accompanied by a fully inspectable trace of intermediate decisions and tool interactions, allowing clinicians to examine how the reported findings are derived. In our experiments, we observe that RadAgent improves Chest CT report generation over its 3D VLM counterpart, CT-Chat, across three dimensions. Clinical accuracy improves by 6.0 points (36.4% relative) in macro-F1 and 5.4 points (19.6% relative) in micro-F1. Robustness under adversarial conditions improves by 24.7 points (41.9% relative). Furthermore, RadAgent achieves 37.0% in faithfulness, a new capability entirely absent in its 3D VLM counterpart. By structuring the interpretation of chest CT as an explicit, tool-augmented and iterative reasoning trace, RadAgent brings us closer toward transparent and reliable AI for radiology.

RadAgent : Un agent IA utilisant des outils pour l'interprétation pas à pas de la tomodensitométrie thoracique

RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography

Résumé

Support