Dataset Collectors API#
MIKASA-Robo-VLA provides two data collection methods:
- PPO Oracle (
get_mikasa_robo_datasets.py) Uses a pre-trained PPO checkpoint to collect expert trajectories. Suitable for all tasks where an oracle checkpoint is available (see the
Sourcecolumn in the Environments & Tasks table).- Motion Planning (
get_mikasa_robo_datasets_motion_planning.py) Uses a geometric planner instead of a learned policy. Required for tasks that cannot be solved by the PPO oracle (e.g. TraceShape).
Collection commands and the full pipeline are documented in Datasets.
The README.md inside
mikasa_robo_suite/vla/dataset_collectors/ contains additional notes on
parallel collection, checkpointing, and resuming interrupted runs.
PPO Oracle Collector#
- class Args(env_id: str | None = 'ShellGameTouch-VLA-v0', path_to_save_data: str = 'data_mikasa_robo', ckpt_dir: str = '.', num_train_data: int = 250)[source]#
Bases:
object
- collect_batched_data_from_ckpt(env_id: str = 'ShellGameTouch-VLA-v0', checkpoint_path: str | None = None, path_to_save_data: str = 'data_mikasa_robo', num_train_data: int = 250)[source]#
Collect episodes in batches; keep a batch only if all episodes are successful.
Motion-Planning Collector#
- BATCHED_TMP_SUBDIR = '_batched'#
python mikasa_robo_suite/vla/dataset_collectors/get_mikasa_robo_datasets_motion_planning.py –env-id BlinkCountButtonPressEasy-VLA-v0 –path-to-save-data data_mikasa_robo –num-train-data 250 –max-attempts 5000
python mikasa_robo_suite/vla/dataset_collectors/get_mikasa_robo_datasets_motion_planning.py –env-id TraceShapeHard-VLA-v0 –path-to-save-data data_mikasa_robo –num-train-data 250 –max-attempts 5000
python mikasa_robo_suite/vla/dataset_collectors/get_mikasa_robo_datasets_motion_planning.py –env-id TraceShapeSeqHard-VLA-v0 –path-to-save-data data_mikasa_robo –num-train-data 250 –max-attempts 5000
python mikasa_robo_suite/vla/dataset_collectors/get_mikasa_robo_datasets_motion_planning.py –env-id GatherAndRecall9-VLA-v0 –path-to-save-data data_mikasa_robo –num-train-data 250 –max-attempts 5000