予測と学習：プロアクティブエージェントにおけるアイドル時間の計算資源の解放

要旨

AIエージェントは推論やツール使用において顕著な能力を示しているものの、基本的には依然として受動的であり、明示的なユーザープロンプトを受けて初めて応答を計算する。このパラダイムは重要な機会を見過ごしている。すなわち、インタラクション間のアイドル時間はほとんど無駄になり、エージェントは将来のユーザーニーズに備えることができない。このギャップを埋めるため、我々はProActを導入する。これは、アイドル時間の計算を活用して、発生し得る将来のユーザーニーズを予測し充足するプロアクティブなエージェントアーキテクチャである。進化する対話履歴と永続的なメモリを併せて分析することで、ProActは今後のニーズを予測し、情報を反復的に取得し、知識のギャップを解消し、ユーザーがクエリを開始する前に証拠を準備することを可能にする。プロアクティブな能力を厳密に評価するため、我々はProActEvalも導入する。これは、予測可能なニーズ連鎖と多様なユーザーの認知プロファイルを備えた、40分野にわたる200シナリオからなる包括的なベンチマークである。実験結果は、受動的なベースラインに対して有意な優位性を示している。ProActEvalにおいて、ProActは必要なターン数を14.8%削減し、ユーザー負担を11.7%軽減し、幻覚率を28.1%削減することで、タスク完了を加速する。さらに、MemBench評価により、ProActが最先端の内省精度を達成し、その持続的かつ堅牢な性能が実証されている。

English

While AI agents demonstrate remarkable capabilities in reasoning and tool use, they remain fundamentally reactive: they compute responses only after explicit user prompts. This paradigm ignores a critical opportunity: the idle time between interactions is largely wasted, leaving agents unable to prepare for future user needs. To bridge this gap, we introduce ProAct, a proactive agent architecture that leverages idle-time compute to anticipate and fulfill likely upcoming user needs. By analyzing evolving dialogue history together with persistent memory, ProAct predicts upcoming needs and iteratively acquires information, allowing the agent to resolve knowledge gaps and prepare evidence before the user initiates a query.To rigorously evaluate proactive capabilities, we also introduce ProActEval, a comprehensive benchmark comprising 200 scenarios across 40 domains, featuring predictable need chains and diverse user cognitive profiles. Empirical results demonstrate significant advantages over reactive baselines. ProAct accelerates task completion by reducing required turns by 14.8%, decreases user effort by 11.7%, and cuts hallucination rates by 28.1% on ProActEval. Furthermore, MemBench evaluations confirm that ProAct achieves state-of-the-art reflective accuracy, underscoring its sustained and robust performance.