Med42-v2: 임상 LLMs 모음

초록

Med42-v2는 일반적인 모델의 한계를 해결하기 위해 설계된 임상 대규모 언어 모델(LLM) 스위트를 소개합니다. 이러한 모델들은 Llama3 아키텍처 위에 구축되었으며 전문적인 임상 데이터를 사용하여 세밀하게 조정되었습니다. 자연어 프롬프트에 효과적으로 응답하기 위해 다단계 선호 정렬을 거쳤습니다. 일반적인 모델들은 종종 임상 질의에 대답을 피하기 위해 선호 정렬을 하지만, Med42-v2는 이 제한을 극복하기 위해 특별히 훈련되어 임상 환경에서 사용할 수 있도록 되었습니다. Med42-v2 모델은 8B 및 70B 매개변수 구성 및 GPT-4에서 다양한 의학적 벤치마크에서 원래의 Llama3 모델보다 우수한 성능을 보여주었습니다. 이러한 LLM은 임상 질의를 이해하고 추론 작업을 수행하며 임상 환경에서 가치 있는 지원을 제공하기 위해 개발되었습니다. 이러한 모델은 이제 https://huggingface.co/m42-health{https://huggingface.co/m42-health}에서 공개적으로 이용 가능합니다.

English

Med42-v2 introduces a suite of clinical large language models (LLMs) designed to address the limitations of generic models in healthcare settings. These models are built on Llama3 architecture and fine-tuned using specialized clinical data. They underwent multi-stage preference alignment to effectively respond to natural prompts. While generic models are often preference-aligned to avoid answering clinical queries as a precaution, Med42-v2 is specifically trained to overcome this limitation, enabling its use in clinical settings. Med42-v2 models demonstrate superior performance compared to the original Llama3 models in both 8B and 70B parameter configurations and GPT-4 across various medical benchmarks. These LLMs are developed to understand clinical queries, perform reasoning tasks, and provide valuable assistance in clinical environments. The models are now publicly available at https://huggingface.co/m42-health{https://huggingface.co/m42-health}.