Flacuna：利用FLAN Fine-Tuning释放维昆纳的问题解决能力

摘要

最近，INSTRUCTEVAL的发布为利用编码器-解码器或仅解码器架构的大型语言模型（LLMs）的性能提供了宝贵的见解。有趣的是，尽管四年前推出，基于T5的LLMs（如FLAN-T5）在需要一般问题解决技能的任务上仍然优于最新的基于解码器的LLMs（如LLAMA和VICUNA）。这种性能差异可以归因于三个关键因素：（1）预训练数据，（2）骨干架构和（3）指令数据集。在这份技术报告中，我们的主要重点是通过利用基于LLAMA的大型语言模型VICUNA来调查第三个因素的影响，该模型已在ChatGPT对话上进行了微调。为实现这一目标，我们使用名为FLANMINI的自定义指令数据集收集对VICUNA进行了微调。该数据集包括众所周知的大规模指令数据集FLAN的子集，以及从ChatGPT/GPT-4衍生的各种与代码相关的数据集和对话数据集。该数据集包含大量需要解决问题技能的任务。我们的实验结果明显表明，我们的模型FLACUNA的增强问题解决能力是通过在FLAN数据集上微调VICUNA获得的，从而在INSTRUCTEVAL的众多基准数据集上取得了显著改进。FLACUNA可在https://huggingface.co/declare-lab/flacuna-13b-v1.0 公开获取。

English

Recently, the release of INSTRUCTEVAL has provided valuable insights into the performance of large language models (LLMs) that utilize encoder-decoder or decoder-only architecture. Interestingly, despite being introduced four years ago, T5-based LLMs, such as FLAN-T5, continue to outperform the latest decoder-based LLMs, such as LLAMA and VICUNA, on tasks that require general problem-solving skills. This performance discrepancy can be attributed to three key factors: (1) Pre-training data, (2) Backbone architecture, and (3) Instruction dataset. In this technical report, our main focus is on investigating the impact of the third factor by leveraging VICUNA, a large language model based on LLAMA, which has undergone fine-tuning on ChatGPT conversations. To achieve this objective, we fine-tuned VICUNA using a customized instruction dataset collection called FLANMINI. This collection includes a subset of the large-scale instruction dataset known as FLAN, as well as various code-related datasets and conversational datasets derived from ChatGPT/GPT-4. This dataset comprises a large number of tasks that demand problem-solving skills. Our experimental findings strongly indicate that the enhanced problem-solving abilities of our model, FLACUNA, are obtained through fine-tuning VICUNA on the FLAN dataset, leading to significant improvements across numerous benchmark datasets in INSTRUCTEVAL. FLACUNA is publicly available at https://huggingface.co/declare-lab/flacuna-13b-v1.0.

Flacuna：利用FLAN Fine-Tuning释放维昆纳的问题解决能力

Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning

摘要

Support