複雑な発話の自然言語分解と解釈

要旨

自然言語インターフェースは、ユーザーの要求をプログラム、データベースクエリ、またはその他の構造化された意図表現に変換するために、しばしば教師付きデータを必要とします。データ収集の際に、ユーザーのニーズの全範囲を予測し形式化することは困難です。例えば、シンプルな要求（「明日の会議を見つけて」や「マネージャーとの会議を正午に移動して」など）を処理するように設計されたシステムにおいても、ユーザーはより複雑な要求（「月曜日と火曜日のすべての電話を入れ替えて」など）を表現する場合があります。本論文では、階層的な自然言語分解プロセスを通じて、シンプルな言語からコードへのモデルが複雑な発話を処理できるようにするアプローチを紹介します。このアプローチでは、事前学習済みの言語モデルを使用して複雑な発話をより小さな自然言語ステップのシーケンスに分解し、各ステップを言語からコードへのモデルで解釈します。このアプローチを検証するために、複雑な発話の分解（Decomposition of Complex Utterances: DeCU）を評価する新しいNL-to-programベンチマークを収集し公開します。実験結果は、提案されたアプローチが複雑な訓練データをほとんど必要とせずに複雑な発話を解釈できることを示し、標準的なfew-shotプロンプティングアプローチを上回る性能を発揮します。

English

Natural language interfaces often require supervised data to translate user requests into programs, database queries, or other structured intent representations. During data collection, it can be difficult to anticipate and formalize the full range of user needs -- for example, in a system designed to handle simple requests (like find my meetings tomorrow or move my meeting with my manager to noon), users may also express more elaborate requests (like swap all my calls on Monday and Tuesday). We introduce an approach for equipping a simple language-to-code model to handle complex utterances via a process of hierarchical natural language decomposition. Our approach uses a pre-trained language model to decompose a complex utterance into a sequence of smaller natural language steps, then interprets each step using the language-to-code model. To test our approach, we collect and release DeCU -- a new NL-to-program benchmark to evaluate Decomposition of Complex Utterances. Experiments show that the proposed approach enables the interpretation of complex utterances with almost no complex training data, while outperforming standard few-shot prompting approaches.

複雑な発話の自然言語分解と解釈

Natural Language Decomposition and Interpretation of Complex Utterances

要旨

Support