Med42-v2: 臨床向けLLMスイート

要旨

Med42-v2は、医療現場における汎用モデルの限界に対処するために設計された一連の臨床用大規模言語モデル（LLM）を導入しています。これらのモデルはLlama3アーキテクチャを基盤としており、専門的な臨床データを用いてファインチューニングされています。自然なプロンプトに効果的に対応するために、多段階の選好アライメントを経ています。汎用モデルは予防策として臨床的な質問に答えないように選好アライメントされることが多いですが、Med42-v2はこの限界を克服するために特別に訓練されており、臨床現場での使用を可能にしています。Med42-v2モデルは、8Bおよび70Bパラメータ構成のオリジナルLlama3モデルやGPT-4と比較して、さまざまな医療ベンチマークで優れた性能を示しています。これらのLLMは、臨床的な質問を理解し、推論タスクを実行し、臨床環境で有益な支援を提供するために開発されています。これらのモデルは現在、https://huggingface.co/m42-health{https://huggingface.co/m42-health}で公開されています。

English

Med42-v2 introduces a suite of clinical large language models (LLMs) designed to address the limitations of generic models in healthcare settings. These models are built on Llama3 architecture and fine-tuned using specialized clinical data. They underwent multi-stage preference alignment to effectively respond to natural prompts. While generic models are often preference-aligned to avoid answering clinical queries as a precaution, Med42-v2 is specifically trained to overcome this limitation, enabling its use in clinical settings. Med42-v2 models demonstrate superior performance compared to the original Llama3 models in both 8B and 70B parameter configurations and GPT-4 across various medical benchmarks. These LLMs are developed to understand clinical queries, perform reasoning tasks, and provide valuable assistance in clinical environments. The models are now publicly available at https://huggingface.co/m42-health{https://huggingface.co/m42-health}.