MetaMind: メタ認知マルチエージェントシステムによる人間の社会的思考のモデル化

要旨

人間の社会的相互作用は、他者の言葉に表れない意図、感情、信念を推測する能力に依存しており、これは心理学の概念である「心の理論（Theory of Mind, ToM）」に基づく認知スキルである。大規模言語モデル（LLMs）は意味理解タスクにおいて優れているが、人間のコミュニケーションに内在する曖昧さや文脈のニュアンスに対処するのが困難である。このギャップを埋めるため、我々はメタ認知の心理学理論に着想を得たマルチエージェントフレームワーク「MetaMind」を提案し、人間のような社会的推論を模倣する。MetaMindは社会的理解を3つの協調的な段階に分解する：(1) 心の理論エージェントがユーザーの心的状態（意図、感情など）に関する仮説を生成し、(2) ドメインエージェントが文化的規範や倫理的制約を用いてこれらの仮説を精緻化し、(3) レスポンスエージェントが文脈に適した応答を生成しながら、推測された意図との整合性を検証する。本フレームワークは、3つの挑戦的なベンチマークにおいて最先端の性能を達成し、実世界の社会的シナリオで35.7%、ToM推論で6.2%の向上を実現した。特に、LLMsが主要なToMタスクにおいて初めて人間レベルの性能に到達することを可能にした。アブレーションスタディは、すべてのコンポーネントの必要性を確認し、文脈の妥当性、社会的適切性、ユーザー適応のバランスを取るフレームワークの能力を示している。この研究は、共感的な対話や文化的に敏感な相互作用への応用を通じて、AIシステムを人間のような社会的知能に近づけるものである。コードはhttps://github.com/XMZhangAI/MetaMindで公開されている。

English

Human social interactions depend on the ability to infer others' unspoken intentions, emotions, and beliefs-a cognitive skill grounded in the psychological concept of Theory of Mind (ToM). While large language models (LLMs) excel in semantic understanding tasks, they struggle with the ambiguity and contextual nuance inherent in human communication. To bridge this gap, we introduce MetaMind, a multi-agent framework inspired by psychological theories of metacognition, designed to emulate human-like social reasoning. MetaMind decomposes social understanding into three collaborative stages: (1) a Theory-of-Mind Agent generates hypotheses user mental states (e.g., intent, emotion), (2) a Domain Agent refines these hypotheses using cultural norms and ethical constraints, and (3) a Response Agent generates contextually appropriate responses while validating alignment with inferred intent. Our framework achieves state-of-the-art performance across three challenging benchmarks, with 35.7% improvement in real-world social scenarios and 6.2% gain in ToM reasoning. Notably, it enables LLMs to match human-level performance on key ToM tasks for the first time. Ablation studies confirm the necessity of all components, which showcase the framework's ability to balance contextual plausibility, social appropriateness, and user adaptation. This work advances AI systems toward human-like social intelligence, with applications in empathetic dialogue and culturally sensitive interactions. Code is available at https://github.com/XMZhangAI/MetaMind.

MetaMind: メタ認知マルチエージェントシステムによる人間の社会的思考のモデル化

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

要旨

Support