대화형 AI를 위한 LLM 기반 사용자 시뮬레이터에서의 목표 정렬

초록

사용자 시뮬레이터는 대화형 AI에서 필수적인 요소로, 시뮬레이션된 상호작용을 통해 확장 가능한 에이전트 개발 및 평가를 가능하게 합니다. 현재의 대형 언어 모델(LLMs)은 사용자 시뮬레이션 능력을 크게 발전시켰지만, 우리는 이러한 모델들이 다중 턴 대화에서 목표 지향적 행동을 일관되게 보여주는 데 어려움을 겪는다는 점을 밝혀냈습니다. 이는 하위 애플리케이션에서의 신뢰성을 저해하는 중요한 한계입니다. 본 연구에서는 사용자 목표 상태 추적(User Goal State Tracking, UGST)이라는 새로운 프레임워크를 소개합니다. 이 프레임워크는 대화 전반에 걸쳐 사용자 목표 진행 상황을 추적합니다. UGST를 활용하여, 우리는 목표 진행 상황을 자율적으로 추적하고 목표에 부합하는 응답을 생성하기 위해 추론할 수 있는 사용자 시뮬레이터 개발을 위한 3단계 방법론을 제시합니다. 또한, 사용자 시뮬레이터의 목표 정렬도를 측정하기 위한 포괄적인 평가 지표를 수립하고, 우리의 접근 방식이 두 벤치마크(MultiWOZ 2.4 및 {\tau}-Bench)에서 상당한 개선을 가져온다는 것을 입증합니다. 본 연구의 기여는 대화형 AI의 중요한 격차를 해소하고, 목표 정렬된 사용자 시뮬레이터 개발을 위한 필수 프레임워크로서 UGST를 확립합니다.

English

User simulators are essential to conversational AI, enabling scalable agent development and evaluation through simulated interactions. While current Large Language Models (LLMs) have advanced user simulation capabilities, we reveal that they struggle to consistently demonstrate goal-oriented behavior across multi-turn conversations--a critical limitation that compromises their reliability in downstream applications. We introduce User Goal State Tracking (UGST), a novel framework that tracks user goal progression throughout conversations. Leveraging UGST, we present a three-stage methodology for developing user simulators that can autonomously track goal progression and reason to generate goal-aligned responses. Moreover, we establish comprehensive evaluation metrics for measuring goal alignment in user simulators, and demonstrate that our approach yields substantial improvements across two benchmarks (MultiWOZ 2.4 and {\tau}-Bench). Our contributions address a critical gap in conversational AI and establish UGST as an essential framework for developing goal-aligned user simulators.

대화형 AI를 위한 LLM 기반 사용자 시뮬레이터에서의 목표 정렬

Goal Alignment in LLM-Based User Simulators for Conversational AI

초록

Support