Medindo Valores Humanos e de IA com base em Psicometria Generativa com Modelos de Linguagem Grandes

Resumo

Os valores humanos e sua medição são objeto de investigação interdisciplinar de longa data. Avanços recentes em IA reacenderam o interesse nessa área, com os grandes modelos de linguagem (LLMs) surgindo tanto como ferramentas quanto como objetos de medição de valores. Este trabalho apresenta a Psicometria Generativa de Valores (GPV), um paradigma de medição de valores baseado em LLM, orientado teoricamente em percepções seletivas reveladas por texto. Começamos ajustando um LLM para medição precisa de valores ao nível da percepção e verificando a capacidade dos LLMs de analisar textos em percepções, formando o núcleo do pipeline da GPV. Aplicando a GPV a blogs escritos por humanos, demonstramos sua estabilidade, validade e superioridade em relação a ferramentas psicológicas anteriores. Em seguida, estendendo a GPV para a medição de valores por LLM, avançamos na arte atual com 1) uma metodologia psicométrica que mede valores de LLM com base em suas saídas escaláveis e de formato livre, permitindo medições específicas de contexto; 2) uma análise comparativa de paradigmas de medição, indicando vieses de resposta de métodos anteriores; e 3) uma tentativa de conectar valores de LLM e sua segurança, revelando o poder preditivo de diferentes sistemas de valores e os impactos de diversos valores na segurança de LLM. Através de esforços interdisciplinares, buscamos aproveitar a IA para psicometria de próxima geração e psicometria para IA alinhada com valores.

English

Human values and their measurement are long-standing interdisciplinary inquiry. Recent advances in AI have sparked renewed interest in this area, with large language models (LLMs) emerging as both tools and subjects of value measurement. This work introduces Generative Psychometrics for Values (GPV), an LLM-based, data-driven value measurement paradigm, theoretically grounded in text-revealed selective perceptions. We begin by fine-tuning an LLM for accurate perception-level value measurement and verifying the capability of LLMs to parse texts into perceptions, forming the core of the GPV pipeline. Applying GPV to human-authored blogs, we demonstrate its stability, validity, and superiority over prior psychological tools. Then, extending GPV to LLM value measurement, we advance the current art with 1) a psychometric methodology that measures LLM values based on their scalable and free-form outputs, enabling context-specific measurement; 2) a comparative analysis of measurement paradigms, indicating response biases of prior methods; and 3) an attempt to bridge LLM values and their safety, revealing the predictive power of different value systems and the impacts of various values on LLM safety. Through interdisciplinary efforts, we aim to leverage AI for next-generation psychometrics and psychometrics for value-aligned AI.

Medindo Valores Humanos e de IA com base em Psicometria Generativa com Modelos de Linguagem Grandes

Measuring Human and AI Values based on Generative Psychometrics with Large Language Models

Resumo

Support