RakutenAI-7B:为日语扩展大型语言模型
RakutenAI-7B: Extending Large Language Models for Japanese
March 21, 2024
作者: Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi, Justin Chiu, Kai Torben Ohlhus, Karan Chopra, Keiji Shinzato, Koji Murakami, Lee Xiong, Lei Chen, Maki Kubota, Maksim Tkachenko, Miroku Lee, Naoki Takahashi, Prathyusha Jwalapuram, Ryutaro Tatsushima, Saurabh Jain, Sunil Kumar Yadav, Ting Cai, Wei-Te Chen, Yandi Xia, Yuki Nakayama, Yutaka Higashiyama
cs.AI
摘要
我们介绍了RakutenAI-7B,这是一套面向日本的大型语言模型,它在日语LM Harness基准测试中表现最佳,超过了所有开源的7B模型。除了基础模型外,我们还发布了经过指导和聊天微调的模型,分别是RakutenAI-7B-instruct和RakutenAI-7B-chat,均采用Apache 2.0许可证。
English
We introduce RakutenAI-7B, a suite of Japanese-oriented large language models
that achieve the best performance on the Japanese LM Harness benchmarks among
the open 7B models. Along with the foundation model, we release instruction-
and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat
respectively, under the Apache 2.0 license.Summary
AI-Generated Summary