Skip to main content
Ctrl+K

MIKASA-Robo-VLA Documentation

  • Installation
  • Quick Start
  • Core Concepts
  • Environments & Tasks
  • Benchmarking
    • Evaluation Protocol
    • Observation and Action Space
    • Datasets
    • Wrappers Cookbook
    • Wrappers API
    • Dataset Collectors API
    • Environments API
    • FAQ and Troubleshooting
  • GitHub
  • Installation
  • Quick Start
  • Core Concepts
  • Environments & Tasks
  • Benchmarking
  • Evaluation Protocol
  • Observation and Action Space
  • Datasets
  • Wrappers Cookbook
  • Wrappers API
  • Dataset Collectors API
  • Environments API
  • FAQ and Troubleshooting
  • GitHub

Section Navigation

  • Shell Game Touch
  • Shell Game Push
  • Shell Game Shuffle Touch
  • Shell Game Color Lamp Touch
  • Shell Game Shuffle Color Lamp Touch
  • Intercept
  • Intercept Grab
  • Rotate Lenient
  • Rotate Strict
  • Take It Back
  • Remember Color
  • Remember Shape
  • Remember Shape And Color
  • Find Imposter Color
  • Find Imposter Shape
  • Find Imposter Shape And Color
  • Bunch Of Colors
  • Seq Of Colors
  • Chain Of Colors
  • Trace Shape
  • Trace Shape Sequence
  • Blink Count Button Press
  • Timed Transfer
  • Batteries Checker Easy
  • Batteries Checker Hard
  • Gather And Recall
  • Environments & Tasks
  • Blink Count Button Press

Blink Count Button Press#

Language Instruction

Count how many times the blue lamp blinks, press the red button exactly that many times when the red lamp turns green, then press the black button to submit your answer.

Task Description

Temporal counting: count blue-lamp blinks, wait for execution, press the red button exactly that many times, then submit.

Render preview for Blink Count Button Press

Source#

  • Module: mikasa_robo_suite.vla.memory_envs.blink_count_button_press_vla

  • Source file: mikasa_robo_suite/vla/memory_envs/blink_count_button_press_vla.py

Difficulty and Parameters#

Easy, Medium, Hard, and Long variants increase count range, delay, and total horizon.

Variants#

Env ID

Horizon

Data Source

BlinkCountButtonPressEasy-Long-VLA-v0

1200

MP

BlinkCountButtonPressEasy-VLA-v0

150

MP

BlinkCountButtonPressHard-Long-VLA-v0

1200

MP

BlinkCountButtonPressHard-VLA-v0

300

MP

BlinkCountButtonPressMedium-Long-VLA-v0

1200

MP

BlinkCountButtonPressMedium-VLA-v0

200

MP

Run Example#

import gymnasium as gym
import torch

import mikasa_robo_suite.vla.memory_envs  # registers VLA env IDs
from mikasa_robo_suite.vla.utils.apply_wrappers import apply_mikasa_vla_wrappers

env = gym.make(
    "BlinkCountButtonPressEasy-VLA-v0",
    num_envs=1,
    obs_mode="rgb",
    control_mode="pd_ee_delta_pose",
    render_mode="all",
)
env = apply_mikasa_vla_wrappers(env)
obs, info = env.reset(seed=42)
for _ in range(env.max_episode_steps):
    action = env.action_space.sample()
    if not torch.is_tensor(action):
        action = torch.as_tensor(action, device=env.unwrapped.device)
    obs, reward, terminated, truncated, info = env.step(action)
env.close()

Dataset Collection#

Motion-planning MIKASA-Robo-90 variants use planner plus replay collection:

uv run python mikasa_robo_suite/vla/dataset_collectors/get_mikasa_robo_datasets_motion_planning.py \
  --env-id BlinkCountButtonPressEasy-VLA-v0 \
  --path-to-save-data data_mikasa_robo \
  --num-train-data 250 \
  --max-attempts 5000 \
  --seed 0

Render Videos#

Generate a fresh MP4/GIF render with:

uv run python utils/prepare_benchmark_demo_videos.py \
  --tasks BlinkCountButtonPressEasy-VLA-v0 \
  --output-dir videos/benchmark_demos \
  --max-attempts-per-task 8 \
  --overwrite

Generated media stays under videos/ and should be published deliberately rather than committed by default.

previous

Trace Shape Sequence

next

Timed Transfer

On this page
  • Source
  • Difficulty and Parameters
  • Variants
  • Run Example
  • Dataset Collection
  • Render Videos
Show Source

© Copyright 2025, Egor Cherepanov, Nikita Kachaev, Alexey K. Kovalev, Aleksandr I. Panov.

Created using Sphinx 8.1.3.

Page views