複数ラベル・複数クラスの意図の共同抽出と検出のためのポインターネットワークベースのアプローチ

要旨

タスク指向型対話システムにおいて、意図検出はユーザーのクエリを解釈し適切な応答を提供するために重要です。既存の研究は主に単一の意図を持つ単純なクエリに焦点を当てており、複数の意図を持つ複雑なクエリを処理し、異なる意図スパンを抽出する効果的なシステムが不足しています。さらに、多言語、多意図のデータセットが著しく不足しています。本研究では、クエリから複数の意図スパンを抽出し、複数の意図を検出し、多言語多ラベルの意図データセットを開発する3つの重要なタスクに取り組んでいます。既存のベンチマークデータセットから収集された新しい多ラベル多クラスの意図検出データセット（MLMCIDデータセット）を紹介しています。また、ポインターネットワークベースのアーキテクチャ（MLMCID）を提案し、セクスタプル形式の粗視化および細分化されたラベルで複数の意図を抽出し検出します。包括的な分析により、当社のポインターネットワークベースのシステムが、様々なデータセットにおいて精度とF1スコアの面でベースライン手法に優越性を示すことが示されています。

English

In task-oriented dialogue systems, intent detection is crucial for interpreting user queries and providing appropriate responses. Existing research primarily addresses simple queries with a single intent, lacking effective systems for handling complex queries with multiple intents and extracting different intent spans. Additionally, there is a notable absence of multilingual, multi-intent datasets. This study addresses three critical tasks: extracting multiple intent spans from queries, detecting multiple intents, and developing a multi-lingual multi-label intent dataset. We introduce a novel multi-label multi-class intent detection dataset (MLMCID-dataset) curated from existing benchmark datasets. We also propose a pointer network-based architecture (MLMCID) to extract intent spans and detect multiple intents with coarse and fine-grained labels in the form of sextuplets. Comprehensive analysis demonstrates the superiority of our pointer network-based system over baseline approaches in terms of accuracy and F1-score across various datasets.

複数ラベル・複数クラスの意図の共同抽出と検出のためのポインターネットワークベースのアプローチ

A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents

要旨

Support