RobotValues: Het evalueren van huishoudrobots bij conflicterende menselijke waarden

Samenvatting

Hoewel huishoudrobots vaak worden geëvalueerd op basis van taakvoltooiing, komen in alledaagse huishoudelijke omgevingen situaties met conflicterende waarden voor waarin van robots wordt verwacht dat zij acties kiezen die andere waarden dan taaksucces prioriteren, zoals menselijke autonomie, efficiëntie of sociale gepastheid. Toch bestaan er geen benchmarks om de waardevoorkeuren van robots in dergelijke scenario's te evalueren. Wij introduceren RobotValues, een benchmark om planners van huishoudrobots te evalueren in 10.000 waardegeschillenscenario's. Elk voorbeeld bestaat uit een realistische huishoudelijke afbeelding met meerdere plausibele robotacties die verschillende menselijke waarden prioriteren. We construeren RobotValues door middel van LLM-ondersteunde scenariogeneratie, belanghebbende-gefundeerde waarde-extractie, beeldgeneratie en automatische kwaliteitscontrole. Met RobotValues evalueren we in de robotica gebruikte VLMs en ontdekken dat modellen standaard waardevoorkeuren vertonen, waaronder veiligheid en accommodatie, terwijl zij minder vaak kiezen voor privacy prioriterende acties. Wanneer de modellen worden geïnstrueerd om specifieke waarden te prioriteren die conflicteren met hun eigen voorkeuren, slagen zij er vaak niet in hun standaardacties te overschrijven, en kiezen zij in 80% van de keren een verkeerde actie. Deze bevindingen suggereren dat evaluatie van huishoudrobots niet alleen taakvoltooiing of veiligheidsnaleving moet meten, maar ook of robots kunnen kiezen tussen plausibele acties wanneer menselijke waarden conflicteren.

English

While household robots are often evaluated based on task completion, everyday domestic environments involve value-conflicting situations in which robots are expected to choose actions that prioritize other values than task success, such as human autonomy, efficiency, or social appropriateness. Yet, there are no benchmarks for evaluating robots' value preferences in such scenarios. We introduce RobotValues, a benchmark to evaluate household robot planners in 10K value-conflict scenarios. Each instance consists of a realistic household image with multiple plausible robot actions that prioritize different human values. We construct RobotValues through LLM-assisted scenario generation, stakeholder-grounded value extraction, image generation and automatic quality control. Using RobotValues we evaluate VLMs used in robotics and find that models exhibit default value preferences, including safety and accommodation, while underselecting privacy-prioritizing actions. When the models are instructed to prioritize specific values that conflict with their own preferences, they often fail to override their default actions, choosing incorrect actions for 80% of the time. These findings suggest that household robot evaluation should measure not only task completion or safety compliance, but also whether robots can choose among plausible actions when human values conflict.