Macaron-A2UI: 개인 에이전트를 위한 생성형 UI 모델

초록

개인 에이전트가 복잡하고 사용자 중심적인 작업을 처리하도록 진화함에 따라, 정적 일반 텍스트 채팅이 급속히 병목 현상으로 작용하고 있다. 생성형 UI가 필요한 새로운 인터페이스 계층으로 부상하며, 상호작용 맥락으로부터 적절한 컨트롤, 옵션 및 상태를 실시간으로 동적으로 합성한다. 본 논문에서는 개인 에이전트를 위한 생성형 UI 모델인 Macaron-A2UI를 제시한다. 목표는 에이전트가 정보 수집, 선호도 정제, 확인 및 다중 목표 조직을 위한 가볍고 실행 가능한 UI 동작과 함께 자연어를 생성할 수 있도록 함으로써 텍스트 전용 상호작용을 넘어서는 것이다. 다양한 이종 대화 소스로부터 대규모 생성형 UI 코퍼스를 구축하고, 통제된 평가를 위한 A2UI-Bench를 도입하며, 파라미터 효율적인 LoRA 기반 지도 미세 조정 후 보상 기반 강화 학습을 통해 30B, 235B 및 754B 모델을 훈련한다. 최고 성능의 Macaron-A2UI 모델은 명시적 스키마 힌트 없이 A2UI-Bench에서 전체 75.6점을 달성하여, 가장 강력한 전체 스키마 최첨단 기준선을 능가한다. 본 연구팀은 모델, 벤치마크 및 평가 프로토콜을 공개하여 개인 에이전트를 위한 생성형 UI 관련 후속 연구를 지원한다.

English

As personal agents evolve to handle complex, user-centric tasks, static plain-text chat is rapidly becoming a bottleneck. Generative UI emerges as the necessary new interface layer, dynamically synthesizing the right controls, options, and state from the interaction context in real time. We present Macaron-A2UI, a model for Generative UI in personal agents. Our goal is to move beyond text-only interaction by enabling agents to generate natural language together with lightweight, executable UI actions for information collection, preference refinement, confirmation, and multi-goal organization. We build a large-scale Generative UI corpus from heterogeneous dialogue sources, introduce A2UI-Bench for controlled evaluation, and train 30B, 235B and 754B models with parameter-efficient LoRA-based supervised fine-tuning followed by reward-driven reinforcement learning. The best Macaron-A2UI model reaches 75.6 overall on A2UI-Bench without explicit schema hints, surpassing the strongest full-schema frontier baseline. We release the models, benchmark, and evaluation protocol to support future work on Generative UI for personal agents.