Table-GPT: 다양한 테이블 작업을 위한 테이블 튜닝 GPT

초록

GPT-3.5와 ChatGPT와 같은 언어 모델은 다양한 인간의 지시를 따르고 광범위한 작업을 수행하는 놀라운 능력을 보여줍니다. 그러나 기본적인 테이블 이해 작업을 통해 언어 모델을 탐구해 보면, 오늘날의 언어 모델들이 여전히 테이블 관련 작업에서 최적의 성능을 발휘하지 못하고 있음을 관찰할 수 있습니다. 이는 이들이 주로 1차원의 자연어 텍스트로 사전 학습되었기 때문일 가능성이 높으며, 반면 관계형 테이블은 2차원 객체이기 때문입니다. 이 연구에서 우리는 새로운 "테이블 튜닝" 패러다임을 제안합니다. 이는 GPT-3.5와 ChatGPT와 같은 언어 모델을 실제 테이블에서 합성된 다양한 테이블 작업 데이터를 사용해 계속해서 학습/미세 조정함으로써, 언어 모델의 테이블 이해 능력과 테이블 작업 수행 능력을 향상시키는 것을 목표로 합니다. 우리는 이를 통해 개발된 Table-GPT 모델이 (1) 테이블 이해 능력이 향상되어, GPT-3.5와 ChatGPT를 다양한 테이블 작업(보유된 미확인 작업 포함)에서 일관되게 능가하며, (2) GPT-3.5와 ChatGPT와 유사한 방식으로 새로운 테이블 작업을 수행하기 위한 다양한 인간의 지시에 응답할 수 있는 강력한 일반화 능력을 보여준다는 것을 입증합니다.

English

Language models, such as GPT-3.5 and ChatGPT, demonstrate remarkable abilities to follow diverse human instructions and perform a wide range of tasks. However, when probing language models using a range of basic table-understanding tasks, we observe that today's language models are still sub-optimal in many table-related tasks, likely because they are pre-trained predominantly on one-dimensional natural-language texts, whereas relational tables are two-dimensional objects. In this work, we propose a new "table-tuning" paradigm, where we continue to train/fine-tune language models like GPT-3.5 and ChatGPT, using diverse table-tasks synthesized from real tables as training data, with the goal of enhancing language models' ability to understand tables and perform table tasks. We show that our resulting Table-GPT models demonstrate (1) better table-understanding capabilities, by consistently outperforming the vanilla GPT-3.5 and ChatGPT, on a wide-range of table tasks, including holdout unseen tasks, and (2) strong generalizability, in its ability to respond to diverse human instructions to perform new table-tasks, in a manner similar to GPT-3.5 and ChatGPT.

Table-GPT: 다양한 테이블 작업을 위한 테이블 튜닝 GPT

Table-GPT: Table-tuned GPT for Diverse Table Tasks

초록

Support