Adversariale Datacollectie: Mens-gecollaboreerde Perturbaties voor Efficiënt en Robuust Imitatieleren in Robotica

Samenvatting

De zoektocht naar data-efficiëntie, waarbij kwaliteit belangrijker is dan kwantiteit, is naar voren gekomen als een hoeksteen in robotmanipulatie, vooral gezien de hoge kosten die gepaard gaan met het verzamelen van real-world data. Wij stellen voor dat het maximaliseren van de informatie-dichtheid van individuele demonstraties de afhankelijkheid van grootschalige datasets aanzienlijk kan verminderen, terwijl de taakprestaties verbeteren. Hiertoe introduceren we Adversarial Data Collection (ADC), een Human-in-the-Loop (HiL) raamwerk dat robotische data-acquisitie herdefinieert door middel van real-time, bidirectionele mens-omgeving interacties. In tegenstelling tot conventionele pipelines die statische demonstraties passief opnemen, neemt ADC een collaboratief verstoringsparadigma aan: tijdens een enkele episode verandert een adversariale operator dynamisch objecttoestanden, omgevingscondities en linguïstische commando's, terwijl de tele-operator adaptief acties aanpast om deze evoluerende uitdagingen te overwinnen. Dit proces comprimeert diverse fout-herstelgedragingen, compositionele taakvariaties en omgevingsverstoringen in minimale demonstraties. Onze experimenten tonen aan dat met ADC getrainde modellen superieure compositionele generalisatie naar onbekende taakinstructies bereiken, verbeterde robuustheid tegen perceptuele verstoringen en emergent foutherstelvermogen. Opvallend is dat modellen die getraind zijn met slechts 20% van het demonstratievolume dat via ADC is verzameld, traditionele benaderingen die volledige datasets gebruiken, significant overtreffen. Deze vooruitgang overbrugt de kloof tussen data-centrische leerparadigma's en praktische robotimplementatie, en toont aan dat strategische data-acquisitie, niet slechts post-hoc verwerking, cruciaal is voor schaalbare, real-world robotleren. Daarnaast zijn we bezig met het samenstellen van een grootschalige ADC-Robotics dataset die real-world manipulatietaken met adversariale verstoringen omvat. Deze benchmark zal open-source worden gemaakt om vooruitgang in robotische imitatieleren te faciliteren.

English

The pursuit of data efficiency, where quality outweighs quantity, has emerged as a cornerstone in robotic manipulation, especially given the high costs associated with real-world data collection. We propose that maximizing the informational density of individual demonstrations can dramatically reduce reliance on large-scale datasets while improving task performance. To this end, we introduce Adversarial Data Collection, a Human-in-the-Loop (HiL) framework that redefines robotic data acquisition through real-time, bidirectional human-environment interactions. Unlike conventional pipelines that passively record static demonstrations, ADC adopts a collaborative perturbation paradigm: during a single episode, an adversarial operator dynamically alters object states, environmental conditions, and linguistic commands, while the tele-operator adaptively adjusts actions to overcome these evolving challenges. This process compresses diverse failure-recovery behaviors, compositional task variations, and environmental perturbations into minimal demonstrations. Our experiments demonstrate that ADC-trained models achieve superior compositional generalization to unseen task instructions, enhanced robustness to perceptual perturbations, and emergent error recovery capabilities. Strikingly, models trained with merely 20% of the demonstration volume collected through ADC significantly outperform traditional approaches using full datasets. These advances bridge the gap between data-centric learning paradigms and practical robotic deployment, demonstrating that strategic data acquisition, not merely post-hoc processing, is critical for scalable, real-world robot learning. Additionally, we are curating a large-scale ADC-Robotics dataset comprising real-world manipulation tasks with adversarial perturbations. This benchmark will be open-sourced to facilitate advancements in robotic imitation learning.

Adversariale Datacollectie: Mens-gecollaboreerde Perturbaties voor Efficiënt en Robuust Imitatieleren in Robotica

Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning

Samenvatting

Support