SEAD: 다중 턴 서비스 대화를 위한 자기 진화 에이전트

초록

대규모 언어 모델은 오픈 도메인 대화에서 뛰어난 능력을 입증해왔습니다. 그러나 서비스 대화 영역에서는 현재 방법론들이 노이즈가 많고 저품질의 인간 대화 데이터에 의존함에 따라 최적의 성능을 발휘하지 못하고 있습니다. 이러한 한계는 데이터 부족과 실제 목표 지향적 사용자 행동을 시뮬레이션하는 데 따른 어려움에서 기인합니다. 이러한 문제를 해결하기 위해 우리는 대규모 인간 주석 없이도 에이전트가 효과적인 전략을 학습할 수 있는 프레임워크인 SEAD(Self-Evolving Agent for Service Dialogue)를 제안합니다. SEAD는 사용자 모델링을 두 가지 구성 요소로 분리합니다: 훈련 커리큘럼을 관리하기 위해 다양한 사용자 상태를 생성하는 프로필 컨트롤러와 현실적인 역할 수행에 중점을 두는 사용자 역할 수행 모델입니다. 이러한 설계는 환경이 불공정한 적대자가 아닌 적응형 훈련 시나리오를 제공하도록 보장합니다. 실험 결과, SEAD가 오픈소스 기반 모델과 클로즈드소스 상용 모델을 크게 능가하며, 작업 완료율을 17.6%, 대화 효율성을 11.1% 향상시키는 것으로 나타났습니다. 코드는 https://github.com/Da1yuqin/SEAD에서 확인할 수 있습니다.

English

Large Language Models have demonstrated remarkable capabilities in open-domain dialogues. However, current methods exhibit suboptimal performance in service dialogues, as they rely on noisy, low-quality human conversation data. This limitation arises from data scarcity and the difficulty of simulating authentic, goal-oriented user behaviors. To address these issues, we propose SEAD (Self-Evolving Agent for Service Dialogue), a framework that enables agents to learn effective strategies without large-scale human annotations. SEAD decouples user modeling into two components: a Profile Controller that generates diverse user states to manage training curriculum, and a User Role-play Model that focuses on realistic role-playing. This design ensures the environment provides adaptive training scenarios rather than acting as an unfair adversary. Experiments demonstrate that SEAD significantly outperforms Open-source Foundation Models and Closed-source Commercial Models, improving task completion rate by 17.6% and dialogue efficiency by 11.1%. Code is available at: https://github.com/Da1yuqin/SEAD.

SEAD: 다중 턴 서비스 대화를 위한 자기 진화 에이전트

SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue

초록

Support