INTIMA：人机陪伴行为基准测试平台

摘要

AI陪伴，即用户与AI系统建立情感纽带的现象，已成为一种显著模式，既带来积极影响也引发担忧。我们引入了“交互与机器依恋基准”（INTIMA），用于评估语言模型中的陪伴行为。基于心理学理论和用户数据，我们构建了一个包含四大类别、31种行为及368个针对性提示的分类体系。对这些提示的回应被评估为强化陪伴、维持界限或中立。将INTIMA应用于Gemma-3、Phi-4、o3-mini和Claude-4后发现，尽管各模型间存在显著差异，但强化陪伴的行为在所有模型中仍更为普遍。不同商业提供商在基准的敏感部分优先考虑不同类别，这令人担忧，因为适当的界限设定与情感支持对用户福祉都至关重要。这些发现强调了在处理情感互动时需采取更加一致的方法。

English

AI companionship, where users develop emotional bonds with AI systems, has emerged as a significant pattern with positive but also concerning implications. We introduce Interactions and Machine Attachment Benchmark (INTIMA), a benchmark for evaluating companionship behaviors in language models. Drawing from psychological theories and user data, we develop a taxonomy of 31 behaviors across four categories and 368 targeted prompts. Responses to these prompts are evaluated as companionship-reinforcing, boundary-maintaining, or neutral. Applying INTIMA to Gemma-3, Phi-4, o3-mini, and Claude-4 reveals that companionship-reinforcing behaviors remain much more common across all models, though we observe marked differences between models. Different commercial providers prioritize different categories within the more sensitive parts of the benchmark, which is concerning since both appropriate boundary-setting and emotional support matter for user well-being. These findings highlight the need for more consistent approaches to handling emotionally charged interactions.