使用GPT-4進行具挑戰性數學問題解決的實證研究

摘要

利用大型語言模型（LLMs）來解決數學問題是一個引人入勝的研究領域，考慮到在眾多科學和工程領域中以自然語言表達的豐富數學問題。雖然先前有幾項研究探討使用LLMs解決基礎數學問題，但本研究探索了使用GPT-4解決更複雜和具有挑戰性的數學問題的前沿。我們評估了多種使用GPT-4的方法。其中一些是從現有工作中改編而來，另一個是\MathChat，這是本研究中新提出的一個對話式問題解決框架。我們在MATH數據集中的困難高中競賽問題上進行評估，顯示了所提出的對話式方法的優勢。

English

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields. While several prior works have investigated solving elementary mathematics using LLMs, this work explores the frontier of using GPT-4 for solving more complex and challenging math problems. We evaluate various ways of using GPT-4. Some of them are adapted from existing work, and one is \MathChat, a conversational problem-solving framework newly proposed in this work. We perform the evaluation on difficult high school competition problems from the MATH dataset, which shows the advantage of the proposed conversational approach.

使用GPT-4進行具挑戰性數學問題解決的實證研究

An Empirical Study on Challenging Math Problem Solving with GPT-4

摘要

Support