敵対的データ収集：効率的かつ堅牢なロボット模倣学習のための人間協調的摂動

要旨

データ効率の追求、すなわち量よりも質を重視するアプローチは、特に現実世界でのデータ収集に伴う高コストを考慮すると、ロボット操作における重要な基盤として浮上しています。我々は、個々のデモンストレーションの情報密度を最大化することが、大規模なデータセットへの依存を劇的に減らしつつ、タスク性能を向上させると提案します。この目的のために、我々はAdversarial Data Collection（ADC）を導入します。これは、リアルタイムの双方向的人間-環境相互作用を通じてロボットデータ収集を再定義するHuman-in-the-Loop（HiL）フレームワークです。静的なデモンストレーションを受動的に記録する従来のパイプラインとは異なり、ADCは協調的摂動パラダイムを採用します：単一のエピソード中に、敵対的オペレーターが物体の状態、環境条件、および言語命令を動的に変更し、テレオペレーターはこれらの変化する課題を克服するために適応的に行動を調整します。このプロセスにより、多様な失敗回復行動、合成的タスク変異、および環境摂動が最小限のデモンストレーションに圧縮されます。我々の実験では、ADCで訓練されたモデルが、未見のタスク指示に対する優れた合成的汎化能力、知覚的摂動に対する強化された頑健性、および新たなエラー回復能力を達成することを示しています。驚くべきことに、ADCを通じて収集されたデモンストレーション量のわずか20%で訓練されたモデルは、完全なデータセットを使用する従来のアプローチを大幅に上回ります。これらの進展は、データ中心の学習パラダイムと実用的なロボット展開の間のギャップを埋め、戦略的なデータ収集が、単なる事後処理ではなく、スケーラブルな現実世界のロボット学習にとって重要であることを示しています。さらに、我々は、敵対的摂動を伴う現実世界の操作タスクを含む大規模なADC-Roboticsデータセットをキュレーションしています。このベンチマークは、ロボット模倣学習の進展を促進するためにオープンソース化されます。

English

The pursuit of data efficiency, where quality outweighs quantity, has emerged as a cornerstone in robotic manipulation, especially given the high costs associated with real-world data collection. We propose that maximizing the informational density of individual demonstrations can dramatically reduce reliance on large-scale datasets while improving task performance. To this end, we introduce Adversarial Data Collection, a Human-in-the-Loop (HiL) framework that redefines robotic data acquisition through real-time, bidirectional human-environment interactions. Unlike conventional pipelines that passively record static demonstrations, ADC adopts a collaborative perturbation paradigm: during a single episode, an adversarial operator dynamically alters object states, environmental conditions, and linguistic commands, while the tele-operator adaptively adjusts actions to overcome these evolving challenges. This process compresses diverse failure-recovery behaviors, compositional task variations, and environmental perturbations into minimal demonstrations. Our experiments demonstrate that ADC-trained models achieve superior compositional generalization to unseen task instructions, enhanced robustness to perceptual perturbations, and emergent error recovery capabilities. Strikingly, models trained with merely 20% of the demonstration volume collected through ADC significantly outperform traditional approaches using full datasets. These advances bridge the gap between data-centric learning paradigms and practical robotic deployment, demonstrating that strategic data acquisition, not merely post-hoc processing, is critical for scalable, real-world robot learning. Additionally, we are curating a large-scale ADC-Robotics dataset comprising real-world manipulation tasks with adversarial perturbations. This benchmark will be open-sourced to facilitate advancements in robotic imitation learning.

敵対的データ収集：効率的かつ堅牢なロボット模倣学習のための人間協調的摂動

Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning

要旨

Support