PrefPalette: 潜在属性を用いたパーソナライズド選好モデリング

要旨

AIシステムをパーソナライズするためには、ユーザーの好みだけでなく、その好みの背後にある理由を理解する必要がある。しかし、現在の選好モデルは、人間の判断をブラックボックスとして扱うことが一般的である。本論文では、PrefPaletteというフレームワークを紹介する。このフレームワークは、選好を属性次元に分解し、人間が理解可能な形で異なるソーシャルコミュニティの価値観に合わせて選好予測を調整する。PrefPaletteは、認知科学の原則である多属性意思決定を2つの方法で実践する：(1) 個々の属性効果（例：形式性、ユーモア、文化的価値観）を分離するために合成トレーニングデータを生成するスケーラブルな反事実的属性合成ステップ、(2) 異なるソーシャルコミュニティがこれらの属性を動的に重み付けする方法を学習するアテンションベースの選好モデリング。このアプローチは、集約的な選好モデリングを超えて、人間の判断を駆動する多様な評価フレームワークを捉える。オンラインプラットフォームRedditの45のソーシャルコミュニティで評価した結果、PrefPaletteはGPT-4oを平均予測精度で46.6%上回った。予測精度の向上に加えて、PrefPaletteは直感的でコミュニティ固有のプロファイルを明らかにした：学術的なコミュニティは詳細さと刺激を重視し、対立志向のコミュニティは皮肉と直接性を価値とし、支援ベースのコミュニティは共感を強調する。人間の判断の属性媒介構造をモデル化することで、PrefPaletteは優れた選好モデリングと透明で解釈可能な洞察を提供し、より信頼性が高く価値認識型のパーソナライズドアプリケーションに向けた第一歩となる。

English

Personalizing AI systems requires understanding not just what users prefer, but the reasons that underlie those preferences - yet current preference models typically treat human judgment as a black box. We introduce PrefPalette, a framework that decomposes preferences into attribute dimensions and tailors its preference prediction to distinct social community values in a human-interpretable manner. PrefPalette operationalizes a cognitive science principle known as multi-attribute decision making in two ways: (1) a scalable counterfactual attribute synthesis step that involves generating synthetic training data to isolate for individual attribute effects (e.g., formality, humor, cultural values), and (2) attention-based preference modeling that learns how different social communities dynamically weight these attributes. This approach moves beyond aggregate preference modeling to capture the diverse evaluation frameworks that drive human judgment. When evaluated on 45 social communities from the online platform Reddit, PrefPalette outperforms GPT-4o by 46.6% in average prediction accuracy. Beyond raw predictive improvements, PrefPalette also shed light on intuitive, community-specific profiles: scholarly communities prioritize verbosity and stimulation, conflict-oriented communities value sarcasm and directness, and support-based communities emphasize empathy. By modeling the attribute-mediated structure of human judgment, PrefPalette delivers both superior preference modeling and transparent, interpretable insights, and serves as a first step toward more trustworthy, value-aware personalized applications.

PrefPalette: 潜在属性を用いたパーソナライズド選好モデリング

PrefPalette: Personalized Preference Modeling with Latent Attributes

要旨

Support