H2O开放生态系统用于最先进的大型语言模型
H2O Open Ecosystem for State-of-the-art Large Language Models
October 17, 2023
作者: Arno Candel, Jon McKinney, Philipp Singer, Pascal Pfeiffer, Maximilian Jeblick, Chun Ming Lee, Marcos V. Conde
cs.AI
摘要
大型语言模型(LLMs)代表了人工智能领域的一场革命。然而,它们也带来了许多重大风险,比如存在偏见、私密、受版权保护或有害文本。因此,我们需要开放、透明和安全的解决方案。我们推出了一个完整的开源生态系统,用于开发和测试LLMs。该项目的目标是推动开放替代方案取代闭源方法。我们发布了h2oGPT,这是一个包含70亿个参数的经过精细调整的LLMs系列。我们还推出了H2O LLM Studio,这是一个框架和无代码图形用户界面,旨在使用最新的尖端技术高效进行LLMs的精细调整、评估和部署。我们的代码和模型使用完全宽松的Apache 2.0许可证。我们相信开源语言模型有助于推动人工智能的发展,并使其更加易于获取和可信赖。演示可在以下网址查看:https://gpt.h2o.ai/
English
Large Language Models (LLMs) represent a revolution in AI. However, they also
pose many significant risks, such as the presence of biased, private,
copyrighted or harmful text. For this reason we need open, transparent and safe
solutions. We introduce a complete open-source ecosystem for developing and
testing LLMs. The goal of this project is to boost open alternatives to
closed-source approaches. We release h2oGPT, a family of fine-tuned LLMs from 7
to 70 Billion parameters. We also introduce H2O LLM Studio, a framework and
no-code GUI designed for efficient fine-tuning, evaluation, and deployment of
LLMs using the most recent state-of-the-art techniques. Our code and models are
licensed under fully permissive Apache 2.0 licenses. We believe open-source
language models help to boost AI development and make it more accessible and
trustworthy. The demo is available at: https://gpt.h2o.ai/