GPT可以在没有计算器的情况下解决数学问题。
GPT Can Solve Mathematical Problems Without a Calculator
September 6, 2023
作者: Zhen Yang, Ming Ding, Qingsong Lv, Zhihuan Jiang, Zehai He, Yuyi Guo, Jinfeng Bai, Jie Tang
cs.AI
摘要
先前的研究通常假设大型语言模型无法准确执行算术运算,特别是超过8位数字的乘法,以及涉及小数和分数的运算,而无需使用计算器工具。本文旨在挑战这一误解。通过充分的训练数据,一个拥有20亿参数的语言模型可以在几乎100%的准确率下执行多位数的算术运算,而无数据泄漏,明显超过了GPT-4(其多位数乘法准确率仅为4.3%)。我们还展示了我们的MathGLM,从GLM-10B微调而来,使用包含额外多步算术运算和文本描述的数学问题数据集,在一个包含5,000个样本的中文数学问题测试集上实现了与GPT-4类似的性能。
English
Previous studies have typically assumed that large language models are unable
to accurately perform arithmetic operations, particularly multiplication of >8
digits, and operations involving decimals and fractions, without the use of
calculator tools. This paper aims to challenge this misconception. With
sufficient training data, a 2 billion-parameter language model can accurately
perform multi-digit arithmetic operations with almost 100% accuracy without
data leakage, significantly surpassing GPT-4 (whose multi-digit multiplication
accuracy is only 4.3%). We also demonstrate that our MathGLM, fine-tuned from
GLM-10B on a dataset with additional multi-step arithmetic operations and math
problems described in text, achieves similar performance to GPT-4 on a
5,000-samples Chinese math problem test set.