Shai:一個用於資產管理的大型語言模型
Shai: A large language model for asset management
December 21, 2023
作者: Zhongyang Guo, Guanran Jiang, Zhongdan Zhang, Peng Li, Zhefeng Wang, Yinchun Wang
cs.AI
摘要
本文介紹了「Shai」,一個針對資產管理行業設計的 10B 級大型語言模型,建立在開源基礎模型之上。通過持續的預訓練和微調,使用針對性語料庫,Shai 在與其領域相關的任務中展現出卓越的表現,勝過基準模型。我們的研究包括開發創新的評估框架,該框架整合了專業資格考試、定制任務、開放式問答和安全評估,全面評估了Shai的能力。此外,我們討論了利用大型語言模型如GPT-4進行資產管理性能評估所面臨的挑戰和影響,建議結合自動評估和人類判斷。Shai的發展展示了10B級大型語言模型在金融領域中的潛力和多功能性,具有顯著的性能和適度的計算需求,希望提供實用見解和方法,協助同行在類似努力中取得成功。
English
This paper introduces "Shai" a 10B level large language model specifically
designed for the asset management industry, built upon an open-source
foundational model. With continuous pre-training and fine-tuning using a
targeted corpus, Shai demonstrates enhanced performance in tasks relevant to
its domain, outperforming baseline models. Our research includes the
development of an innovative evaluation framework, which integrates
professional qualification exams, tailored tasks, open-ended question
answering, and safety assessments, to comprehensively assess Shai's
capabilities. Furthermore, we discuss the challenges and implications of
utilizing large language models like GPT-4 for performance assessment in asset
management, suggesting a combination of automated evaluation and human
judgment. Shai's development, showcasing the potential and versatility of
10B-level large language models in the financial sector with significant
performance and modest computational requirements, hopes to provide practical
insights and methodologies to assist industry peers in their similar endeavors.