BEDA: 전략적 대화 행위 수행을 위한 확률적 제약 조건으로서의 신념 추정

초록

전략적 대화를 위해서는 에이전트가 서로 다른 대화 행위를 수행해야 하며, 이를 위해 믿음 추정이 필수적입니다. 기존 연구들은 믿음을 정확히 추정하는 경우가 많았지만, 생성 과정에서 그러한 믿음을 활용하는 원리적인 메커니즘이 부족했습니다. 우리는 이 격차를 해소하기 위해 먼저 적대적 행위와 조정 행위라는 두 가지 핵심 행위를 공식화하고, 에이전트가 생성할 수 있는 내용에 대한 확률적 제약을 통해 이를 운영화했습니다. 우리는 이러한 아이디어를 BEDA 프레임워크에 구현했으며, 이는 세계 집합, 믿음 추정을 위한 믿음 추정기, 그리고 추론된 믿음과 일관된 행위를 선택하고 발화를 실현하는 조건부 생성기로 구성됩니다. 조건형 키퍼-강도(CKBG, 적대적), 상호 친구(MF, 협력적), CaSiNo(협상)라는 세 가지 설정에서 BEDA는 강력한 기준 모델들을 일관되게 능가했습니다: CKBG에서는 백본 모델별로 성공률을 최소 5.0점, GPT-4.1-nano를 사용할 때는 20.6점 향상시켰으며; Mutual Friends에서는 평균 9.3점의 향상을 달성했고; CaSiNo에서는 모든 기준 모델 대비 최적의 거래를 달성했습니다. 이러한 결과는 믿음 추정을 제약 조건으로 설정하는 것이 신뢰할 수 있는 전략적 대화를 위한 단순하면서도 일반적인 메커니즘을 제공함을 시사합니다.

English

Strategic dialogue requires agents to execute distinct dialogue acts, for which belief estimation is essential. While prior work often estimates beliefs accurately, it lacks a principled mechanism to use those beliefs during generation. We bridge this gap by first formalizing two core acts Adversarial and Alignment, and by operationalizing them via probabilistic constraints on what an agent may generate. We instantiate this idea in BEDA, a framework that consists of the world set, the belief estimator for belief estimation, and the conditional generator that selects acts and realizes utterances consistent with the inferred beliefs. Across three settings, Conditional Keeper Burglar (CKBG, adversarial), Mutual Friends (MF, cooperative), and CaSiNo (negotiation), BEDA consistently outperforms strong baselines: on CKBG it improves success rate by at least 5.0 points across backbones and by 20.6 points with GPT-4.1-nano; on Mutual Friends it achieves an average improvement of 9.3 points; and on CaSiNo it achieves the optimal deal relative to all baselines. These results indicate that casting belief estimation as constraints provides a simple, general mechanism for reliable strategic dialogue.

BEDA: 전략적 대화 행위 수행을 위한 확률적 제약 조건으로서의 신념 추정

BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts

초록

Support