利用大型语言模型在YAML中自动生成信息技术任务代码
Automated Code generation for Information Technology Tasks in YAML through Large Language Models
May 2, 2023
作者: Saurabh Pujar, Luca Buratti, Xiaojie Guo, Nicolas Dupuis, Burn Lewis, Sahil Suneja, Atin Sood, Ganesh Nalawade, Matt Jones, Alessandro Morari, Ruchir Puri
cs.AI
摘要
最近,由于大型语言模型的使用,代码生成能力得到了显著改善,主要受益于通用编程语言。领域特定语言,比如用于IT自动化的语言,尽管涉及许多活跃开发人员并且是现代云平台的重要组成部分,但受到的关注却较少。本研究侧重于Ansible-YAML的生成,这是一种广泛用于IT自动化的标记语言。我们提出了Ansible Wisdom,这是一个旨在提高IT自动化生产力的自然语言到Ansible-YAML代码生成工具。Ansible Wisdom是一个基于Transformer的模型,通过使用包含Ansible-YAML的新数据集进行训练进行了扩展。我们还开发了两个针对YAML和Ansible的新颖性能指标,以捕捉该领域的特定特征。结果表明,Ansible Wisdom能够准确地从自然语言提示中生成Ansible脚本,其性能与现有最先进的代码生成模型相当或更好。
English
The recent improvement in code generation capabilities due to the use of
large language models has mainly benefited general purpose programming
languages. Domain specific languages, such as the ones used for IT Automation,
have received far less attention, despite involving many active developers and
being an essential component of modern cloud platforms. This work focuses on
the generation of Ansible-YAML, a widely used markup language for IT
Automation. We present Ansible Wisdom, a natural-language to Ansible-YAML code
generation tool, aimed at improving IT automation productivity. Ansible Wisdom
is a transformer-based model, extended by training with a new dataset
containing Ansible-YAML. We also develop two novel performance metrics for YAML
and Ansible to capture the specific characteristics of this domain. Results
show that Ansible Wisdom can accurately generate Ansible script from natural
language prompts with performance comparable or better than existing state of
the art code generation models.