RoundTable: 動的スキーマとコンテクスト自動補完を活用した、表形式の質問応答におけるクエリ精度向上

要旨

大規模言語モデル（LLM）の進歩により、登場した主要なユースケースの1つは、プレーンな英語でデータベースにクエリを投げることであり、ユーザーの質問を実行可能なデータベースクエリに変換することが大幅に改善されました。ただし、実世界のデータセットには多くの属性や複雑な値が含まれており、LLMが自然言語クエリから関連する列や値を正確に特定するタスクが複雑化しています。従来の方法ではデータセットの規模や複雑さをLLMに完全に伝えることができません。これらの課題に対処するため、入力テーブルでのフルテキスト検索（FTS）を活用する新しいフレームワークを提案します。このアプローチは特定の値や列を正確に検出するだけでなく、言語モデルの検索空間を狭めることでクエリの精度を向上させます。さらに、テーブル内のデータに基づいてクエリを提案するカスタムオートコンプリート機能をサポートしています。この統合により、ユーザーと複雑なデータセットとのインタラクションが大幅に改善され、現在のテーブルクエリ機能が直面している制約に対する洗練されたソリューションが提供されます。この研究には、MacとWindowsプラットフォームの両方に対応したアプリケーションも付属しており、読者は自分自身のデータで試すことができます。

English

With advancements in Large Language Models (LLMs), a major use case that has emerged is querying databases in plain English, translating user questions into executable database queries, which has improved significantly. However, real-world datasets often feature a vast array of attributes and complex values, complicating the LLMs task of accurately identifying relevant columns or values from natural language queries. Traditional methods cannot fully relay the datasets size and complexity to the LLM. To address these challenges, we propose a novel framework that leverages Full-Text Search (FTS) on the input table. This approach not only enables precise detection of specific values and columns but also narrows the search space for language models, thereby enhancing query accuracy. Additionally, it supports a custom auto-complete feature that suggests queries based on the data in the table. This integration significantly refines the interaction between the user and complex datasets, offering a sophisticated solution to the limitations faced by current table querying capabilities. This work is accompanied by an application for both Mac and Windows platforms, which readers can try out themselves on their own data.

RoundTable: 動的スキーマとコンテクスト自動補完を活用した、表形式の質問応答におけるクエリ精度向上

RoundTable: Leveraging Dynamic Schema and Contextual Autocomplete for Enhanced Query Precision in Tabular Question Answering

要旨

Support