truelabel

PHYSICAL AI DATASETS

Every physical AI dataset, in one browsable directory.

Search and filter across robotics, egocentric, teleoperation, manipulation, and simulation datasets. Each row links to a dataset page with modality, license, commercial-use, and procurement notes.

658 dataset pages indexed

Commercial use
License
Modality
Task
Robot
Format

Showing 50 of 658 datasets

IPEC-COMMUNITY/bridge_orig_lerobot – 53K Episodes for Robotic Manipulation

Published Jun 2026 · Dataset profile

IPEC-COMMUNITY/bridge_orig_lerobot is a large-scale robotic manipulation dataset containing 53,192 episodes across 19,974 distinct tasks, totaling 1,893,026 frames of WidowX robot video captured at 5 fps.

  • Teleoperation
  • Manipulation
  • Manipulation

IPEC-COMMUNITY/droid_lerobot – 27M Frames, Apache-2.0, Franka Robot Teleoperation

Published Jun 2026 · Dataset profile

IPEC-COMMUNITY/droid_lerobot is a large-scale robotics dataset comprising 92,233 episodes and over 27 million frames of Franka Panda manipulation teleoperation, released under the permissive Apache-2.0 license.

  • Teleoperation
  • Manipulation
  • Manipulation

cadene/droid_1.0.1 – 95K Franka Episodes for Vision-Language-Action Training

Published Jun 2026 · Dataset profile

cadene/droid_1.0.1 is an Apache-2.0 licensed robotics dataset containing 95,600 episodes collected from Franka robotic arms, totaling 27.6 million frames captured at 15fps with multiple camera views per episode.

  • Teleoperation
  • Manipulation
  • Manipulation

NVIDIA GR00T X-Embodiment Sim Dataset – 9k Bimanual Trajectories

Published Jun 2026 · Dataset profile

The NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim dataset provides 9,000 simulated bimanual manipulation trajectories across multiple robot embodiments, released under the permissive CC-BY-4.0 license.

  • Manipulation
  • Simulation
  • Manipulation

IPEC-COMMUNITY/language_table_lerobot – Apache-2.0 Licensed xArm Dataset

Published Jun 2026 · Dataset profile

IPEC-COMMUNITY/language_table_lerobot is an Apache-2.0 licensed robotics dataset containing 442,226 episodes and 7,045,476 frames of xArm manipulation data recorded at 10 FPS.

  • Teleoperation
  • Manipulation
  • Manipulation

dk1_duplo_disassembly_2026-04-04

Published May 2026 · apache-2.0 · andreaskoepf

This dataset was created using LeRobot. Dataset Description Bimanual duplo disassembly demonstrations with DK1 robot Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: separate the duplo bricks Episodes: 42…

  • 357 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

gs_scenes

Published May 2026 · apache-2.0 · RukawaY

A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting Ziyuan Xia • Jingyi Xu • Chong Cui • Yuanhong Yu• Jiazhao Zhang • Qingsong Yan • Tao Ni Junbo Chen • Xiaowei Zhou • Hujun Bao • Ruizhen Hu • Sida Peng 🤗 About This…

  • 1,779 downloads
  • 1 likes
  • n<1K
  • Paper available

Open X-Embodiment

Published May 2026 · custom

A large cross-institution collection of robot demonstrations spanning many embodiments and manipulation tasks.

  • Multi-institution robot demonstration corpus; exact per-task scale varies by contributing dataset.
  • Commercial use unclear
  • Best for: robot foundation model pretraining
  • RGB-D
  • Proprioception
  • Robot Grasping

DROID

Published May 2026 · custom

A real-world robot manipulation dataset focused on diverse teleoperated demonstrations outside narrow lab-only settings.

  • Large real-world manipulation corpus; check source for current release counts.
  • Commercial use unclear
  • Best for: real-world manipulation pretraining
  • Teleoperation
  • RGB-D
  • Robot Grasping

BridgeData V2

Published May 2026 · custom

A robot manipulation dataset from Berkeley focused on real-world behavior cloning and task generalization.

  • Robot manipulation demonstrations across multiple tasks; source release describes exact split.
  • Commercial use unclear
  • Best for: behavior cloning
  • RGB-D
  • Proprioception
  • Robot Grasping

RT-1

Published May 2026 · custom

A robotics transformer data release associated with language-conditioned robot manipulation research.

  • Large language-conditioned robot demonstrations described in the source paper and project materials.
  • Commercial use unclear
  • Best for: language-conditioned robotics
  • RGB-D
  • Proprioception
  • Household Manipulation

ALOHA

Published May 2026 · custom

A low-cost bimanual teleoperation platform and dataset family used for imitation learning in dexterous manipulation.

  • Task-specific demonstrations released around the ALOHA platform and follow-on projects.
  • Commercial use unclear
  • Best for: bimanual imitation learning
  • Teleoperation
  • RGB-D
  • Bimanual Manipulation

RoboMimic

Published May 2026 · mit

A benchmark and dataset framework for robot imitation learning with standardized tasks and evaluation utilities.

  • Benchmark datasets and demonstration formats vary by task suite.
  • Source appears permissive; verify data terms
  • Best for: imitation-learning baselines
  • Proprioception
  • RGB-D
  • Robot Grasping

RoboNet

Published May 2026 · custom

A multi-robot dataset for visual foresight and manipulation policy research.

  • Multi-robot manipulation dataset; source materials specify exact robot/task counts.
  • Commercial use unclear
  • Best for: visual dynamics
  • RGB-D
  • Proprioception
  • Robot Grasping

Meta-World

Published May 2026 · mit

A simulated manipulation benchmark for multi-task and meta-reinforcement learning.

  • Benchmark suite of simulated manipulation tasks.
  • Source appears permissive; verify data terms
  • Best for: multi-task RL
  • Proprioception
  • Robot Grasping

RLBench

Published May 2026 · custom

A simulated robot learning benchmark with many manipulation tasks in CoppeliaSim.

  • Large simulated task suite; source materials define current task count.
  • Commercial use unclear
  • Best for: simulated manipulation
  • RGB-D
  • Proprioception
  • Robot Grasping

CALVIN

Published May 2026 · custom

A benchmark for language-conditioned long-horizon robot manipulation in simulated environments.

  • Long-horizon simulated benchmark and demonstrations.
  • Commercial use unclear
  • Best for: language-conditioned policy evaluation
  • RGB-D
  • Proprioception
  • Household Manipulation

ManiSkill

Published May 2026 · apache-2

A simulation benchmark and toolkit for manipulation skills and embodied AI policy evaluation.

  • Simulation suite with tasks and environments maintained by the ManiSkill project.
  • Source appears permissive; verify data terms
  • Best for: simulated manipulation
  • RGB-D
  • Proprioception
  • Robot Grasping

Ego4D

Published May 2026 · custom

A large-scale egocentric video dataset focused on first-person human activity understanding.

  • Large-scale first-person video corpus with annotations; source controls exact access terms.
  • Commercial use restricted
  • Best for: egocentric perception
  • Egocentric video
  • Motion capture
  • Human Object Interaction

EPIC-KITCHENS

Published May 2026 · custom

An egocentric video dataset of kitchen activities used for action recognition and human-object interaction research.

  • Large egocentric kitchen activity dataset with action annotations.
  • Commercial use restricted
  • Best for: egocentric action recognition
  • Egocentric video
  • Human Object Interaction

HOI4D

Published May 2026 · custom

A 4D egocentric human-object interaction dataset with RGB-D and pose-oriented annotations.

  • Human-object interaction dataset with 4D annotations; exact release details live on the source site.
  • Commercial use unclear
  • Best for: hand-object perception
  • Egocentric video
  • RGB-D
  • Human Object Interaction

DexYCB

Published May 2026 · custom

A dexterous hand-object interaction dataset centered on grasping YCB objects with 3D annotations.

  • Dexterous grasping dataset with YCB objects and pose labels.
  • Commercial use unclear
  • Best for: hand pose estimation
  • RGB-D
  • Motion capture
  • Human Object Interaction

ScanNet

Published May 2026 · custom

An indoor RGB-D reconstruction dataset used for 3D scene understanding.

  • Large indoor RGB-D scene reconstruction corpus.
  • Commercial use restricted
  • Best for: 3D scene understanding
  • RGB-D
  • Point cloud
  • Navigation

Habitat datasets

Published May 2026 · custom

A family of embodied AI datasets and simulation assets for navigation and rearrangement research.

  • Multiple scene and task datasets under the AI Habitat ecosystem.
  • Commercial use unclear
  • Best for: embodied navigation
  • RGB-D
  • Point cloud
  • Navigation

AI2-THOR

Published May 2026 · apache-2

An interactive simulated environment for embodied AI agents in household-like scenes.

  • Interactive household simulation scenes and tasks.
  • Source appears permissive; verify data terms
  • Best for: embodied agent research
  • RGB-D
  • Navigation

BEHAVIOR

Published May 2026 · custom

A benchmark for household activities and embodied AI tasks in simulation.

  • Household activity benchmark and simulation assets.
  • Commercial use unclear
  • Best for: household task taxonomies
  • RGB-D
  • Proprioception
  • Household Manipulation

Something-Something V2

Published May 2026 · custom

A human action video dataset focused on object interactions and temporal reasoning.

  • Large human-object action video dataset; access and current terms are controlled by the dataset host.
  • Commercial use restricted
  • Best for: temporal action recognition
  • Third-person video
  • Human Object Interaction

Waymo Open Dataset

Published May 2026 · custom

A large autonomous driving dataset with camera, LiDAR, and labeled traffic scenes.

  • Large autonomous driving scenes with cameras and LiDAR.
  • Commercial use restricted
  • Best for: autonomous driving perception
  • RGB-D
  • Point cloud
  • Navigation

ObjectFolder

Published May 2026 · custom

A dataset family for object-centric physical properties, geometry, and multimodal perception research.

  • Object-centric multimodal assets; source materials define current object count and modalities.
  • Commercial use unclear
  • Best for: object perception
  • RGB-D
  • Tactile
  • Robot Grasping

BC-Z

Published May 2026 · custom

A behavior cloning project focused on zero-shot task generalization for robots.

  • Task demonstrations and model references associated with the BC-Z project.
  • Commercial use unclear
  • Best for: behavior cloning
  • RGB-D
  • Proprioception
  • Robot Grasping

DexMV

Published May 2026 · custom

A dexterous manipulation dataset focused on multi-view visual observations and hand-object interaction.

  • Dexterous manipulation demonstrations and visual observations; verify source for release details.
  • Commercial use unclear
  • Best for: dexterous manipulation
  • RGB-D
  • Motion capture
  • Human Object Interaction

Kinetics

Published May 2026 · custom

A large video action recognition dataset used widely for video model pretraining.

  • Large action recognition video corpus maintained through public benchmark releases.
  • Commercial use restricted
  • Best for: video pretraining
  • Third-person video
  • Human Object Interaction

RoboSuite

Published May 2026 · mit

A simulation framework and benchmark suite for robot manipulation tasks.

  • Simulation tasks and assets for manipulation research.
  • Source appears permissive; verify data terms
  • Best for: robot manipulation simulation
  • RGB-D
  • Proprioception
  • Robot Grasping

RH20T

Published May 2026 · custom

A real-world contact-rich robot manipulation dataset with multimodal sensing, force, audio, and human demonstration video.

  • Source describes more than 110,000 contact-rich manipulation sequences with visual, force, audio, action, and human demonstration signals.
  • Commercial use unclear
  • Best for: contact-rich manipulation
  • Teleoperation
  • RGB-D
  • Robot Grasping

AgiBot World

Published May 2026 · custom

A large-scale real-world robot manipulation dataset family for fine-grained manipulation, tool use, and multi-robot collaboration.

  • Hugging Face organization page describes the Beta release as 1M+ trajectories and 2,976.4 hours across 217 tasks, 87 skills, 3,000+ objects, and 100+ real-world scenarios.
  • Commercial use unclear
  • Best for: large-scale manipulation pretraining
  • Teleoperation
  • RGB-D
  • Household Manipulation

RoboCasa

Published May 2026 · custom

A large-scale kitchen simulation framework and dataset family for everyday manipulation tasks in diverse household environments.

  • RoboCasa365 source materials describe 365 everyday tasks, 2,500 kitchen environments, 600+ hours of human demonstration data, and 1,600+ hours of synthetic demonstrations.
  • Commercial use unclear
  • Best for: large-scale kitchen simulation
  • RGB-D
  • Proprioception
  • Household Manipulation

LIBERO

Published May 2026 · custom

A benchmark suite for lifelong robot learning and language-conditioned manipulation tasks.

  • Benchmark datasets are organized around multiple LIBERO task suites, including spatial, object, goal, and long-horizon manipulation variants.
  • Commercial use unclear
  • Best for: VLA benchmark evaluation
  • RGB-D
  • Proprioception
  • Robot Grasping

RoboSet

Published May 2026 · custom

A real-world multi-task kitchen manipulation dataset with teleoperated and kinesthetic demonstrations.

  • Source describes 30,050 trajectories, including 9,500 collected through teleoperation, across 12 skills and 38 tasks with four camera views.
  • Commercial use unclear
  • Best for: real-world kitchen manipulation
  • Teleoperation
  • RGB-D
  • Household Manipulation

RoboTurk

Published May 2026 · custom

A large-scale teleoperation data collection platform and dataset family for robot manipulation tasks.

  • Project materials describe over 100 hours of real robot data and thousands of successful manipulation demonstrations collected through remote users.
  • Commercial use unclear
  • Best for: teleoperation collection design
  • Teleoperation
  • RGB-D
  • Robot Grasping

UMI

Published May 2026 · custom

Universal Manipulation Interface is an in-the-wild human demonstration framework for transferring portable gripper data to robot policies.

  • Project materials emphasize portable in-the-wild data collection and fast demonstrations for tasks such as cup manipulation, dish washing, cloth folding, and dynamic tossing.
  • Commercial use unclear
  • Best for: portable in-the-wild demonstrations
  • Egocentric video
  • Teleoperation
  • Bimanual Manipulation

FurnitureBench

Published May 2026 · custom

A real-world long-horizon furniture assembly benchmark with successful demonstration data.

  • Documentation describes 219.6 hours and 5,100 successful furniture assembly demonstrations collected with controller and keyboard inputs.
  • Commercial use unclear
  • Best for: long-horizon assembly
  • Teleoperation
  • RGB-D
  • Furniture Assembly

TACO Play

Published May 2026 · custom

A kitchen robot manipulation dataset with Franka arm interaction data available through TensorFlow Datasets.

  • TensorFlow Datasets documentation lists TACO Play as Franka kitchen interaction data with train and test splits and a 47.77 GiB dataset size.
  • Commercial use unclear
  • Best for: TFDS robotics ingestion
  • RGB-D
  • Proprioception
  • Household Manipulation

LeRobot datasets

Published May 2026 · custom

A Hugging Face robotics dataset ecosystem and standardized dataset format for multimodal robot learning data.

  • LeRobot documentation describes a standardized dataset ecosystem on Hugging Face Hub using Parquet for tabular data and MP4 for video observations.
  • Commercial use unclear
  • Best for: robotics dataset distribution
  • Teleoperation
  • RGB-D
  • Robot Grasping

blueberry-kitchen

Published Apr 2026 · apache-2.0 · fecasado

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "blueberry_ros", "total_episodes": 80, "total_frames": 29986, "total_tasks": 1, "chunks_size": 1000,…

  • 539 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

LARYBench

Published Apr 2026 · cc-by-nc-sa-4.0 · meituan-longcat

LARY — A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment LARY is a unified evaluation framework for latent action representations.

  • 8,102 downloads
  • 16 likes
  • 1M<n<10M
  • Paper available

sfp_dataset

Published Apr 2026 · apache-2.0 · pvrohin

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ur5e_aic", "total_episodes": 50, "total_frames": 91714, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 488 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

CostNav-Teleop-Dataset

Published Apr 2026 · apache-2.0 · maum-ai

CostNav Teleop Dataset Dataset Summary The CostNav Teleop Dataset is a large-scale collection of human teleoperation recordings for robot navigation in an urban sidewalk simulation environment.

  • 5,037 downloads
  • 1K<n<10K
  • Paper available
  • Tabular
  • Text
  • Csv

D2E-480p

Published Apr 2026 · cc-by-nc-4.0 · open-world-agents

D2E-480p Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI.

  • 2,236 downloads
  • 1 likes
  • n<1K
  • Paper available
  • Video

D2E-Original

Published Apr 2026 · cc-by-nc-4.0 · open-world-agents

D2E-Original Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI.

  • 2,230 downloads
  • 2 likes
  • n<1K
  • Paper available
  • Video

agibotworld-alpha

Published Apr 2026 · cc-by-sa-4.0 · yxgao

from datasets import Features, Value, Video, load_dataset features = Features({ "head_color.mp4": Video(), "proprio_stats.hdf5": Value("binary"), "task.json": { "episode_id": Value("int64"), "init_scene_text": Value("string"),…

  • 2,423 downloads
  • n>1T

briobin

Published Apr 2026 · apache-2.0 · buggybrain

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 51, "total_frames": 22577, "total_tasks": 1, "chunks_size": 1000,…

  • 364 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

DOM

Published Apr 2026 · other · hzxie

Dynamic Object Manipulation (DOM) Project Page | Paper | Code TL;DR: DOM is a large-scale dynamic manipulation dataset with 200K episodes, 2,800+ scenes, and 206 objects for training and evaluating VLA models.

  • 6,931 downloads
  • 6 likes
  • 100K<n<1M
  • Paper available

AgiBotWorld2026

Published Apr 2026 · cc-by-nc-sa-4.0 · agibot-world

AgiBot World 2026 Real-World Embodied Intelligence Dataset Overview As robotics research advances into real-world scenarios, the demand for authentic, high-quality data has become increasingly urgent.

  • 47,447 downloads
  • 28 likes
  • 1K<n<10K
  • Image
  • Text

dynamics-probing

Published Apr 2026 · mit · leesangoh

PhysProbe Dynamics Probing Dataset Manipulation episodes from Isaac Lab collected for probing physics understanding in video world models (V-JEPA 2, VideoMAE, DINOv2).

  • 3,873 downloads
  • 1K<n<10K
  • Video

GNN_Disassembly_WorldModel

Published Apr 2026 · cc-by-4.0 · ChangChrisLiu

GNN Constraint-Aware World Model Dataset (v3) Real robot episodes with per-frame constraint graphs, SAM2 segmentation masks + 256-D feature embeddings, full 3D depth bundles, and synchronized robot states across two manipulation domains.

  • 10,423 downloads
  • 2 likes
  • 1K<n<10K

musclemimic-retargeted

Published Apr 2026 · other · amathislab

MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoFullBody musculoskeletal model, generated using General Motion Retargeting (GMR).

  • 1,174 downloads
  • 5 likes
  • not specified
  • Paper available
  • Gated access

10Kh-RealOmin-OpenData

Published Apr 2026 · cc-by-sa-4.0 · genrobot2025

Boasting over 13,000 hours of cumulative data and 5 million+ clips, it ranks as the largest open-source embodied intelligence dataset in the industry. Update Notes:Stage 3 data upload completed.

  • 200,660 downloads
  • 211 likes
  • n>1T
  • Gated access
  • Video

Android-Control

Published Apr 2026 · apache-2.0 · leosltl

Android Control (Community Mirror) This is a community mirror of the official Android Control dataset by Google Research, hosted on Hugging Face for easier access.

  • 355 downloads
  • 10K<n<100K
  • Paper available

vlabench_primitive_ft_lerobot_video

Published Apr 2026 · mit · VLABench

VLABench Primitive Tasks Dataset - LeRobot v3.0 Dataset Description This dataset is organized in the LeRobot v3.0 format and is used for integrating VLABench into the LeRobot framework officially.

  • 2,655 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

ViFailback-Dataset

Published Apr 2026 · mit · sii-rhos-ai

ViFailback Dataset: Real-World Robotic Manipulation Failure Dataset with Visual Symbol Guidance A real-world dataset for diagnosing, correcting, and learning from robotic manipulation failures via visual symbols.

  • 25,323 downloads
  • 4 likes
  • n<1K
  • Paper available
  • Image
  • Imagefolder

Master_wire_hook_20260402

Published Apr 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 638, "total_frames": 421684, "total_tasks": 1, "chunks_size": 1000,…

  • 2,046 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

ARCH_master_wire_hook_20260402

Published Apr 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 703, "total_frames": 456261, "total_tasks": 1, "chunks_size": 1000,…

  • 1,998 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

droid_xxjd_7_canonical

Published Apr 2026 · apache-2.0 · michios

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 80, "total_frames": 33909, "total_tasks": 26, "chunks_size": 1000, "data_files_size_in_mb":…

  • 320 downloads
  • 10K<n<100K
  • Tabular
  • Text
  • Parquet

reachy-pick-and-place-images

Published Apr 2026 · apache-2.0 · erl-hub

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "reachy2", "total_episodes": 25, "total_frames": 6652, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 597 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

libero_cot

Published Apr 2026 · apache-2.0 · yinchenghust

This dataset was created using LeRobot. It contains embodied Chain-of-Thought (CoT) demonstrations for the LIBERO benchmark, featuring paired reasoning and action traces.

  • 1,016 downloads
  • 100K<n<1M
  • Paper available
  • Image
  • Timeseries
  • Parquet

PhysicalAI-Robotics-NuRec

Published Apr 2026 · cc-by-4.0 · nvidia

Dataset Description The Physical AI NuRec dataset seeks to empower robotic researchers to build the next generation of physical AI based end-to-end robotic models.

  • 751 downloads
  • 57 likes
  • not specified
  • Gated access
  • 3d
  • Image

seed

Published Apr 2026 · other · bones-studio

BONES-SEED: Skeletal Everyday Embodiment Dataset BONES-SEED (Skeletal Everyday Embodiment Dataset) is an open dataset of 142,220 annotated human motion animations for humanoid robotics.

  • 3,468 downloads
  • 115 likes
  • 100K<n<1M
  • Gated access

example_data_fastumi_pro_raw

Published Apr 2026 · other · LumosRobotics-FastUMIPro

FastUMI Pro – Multimodal Sample Dataset Small-Scale Demonstration Data from the FastUMI Pro Multimodal Sensing System (Only Hundreds of Trajectories — Full Dataset Available Upon Request) Project Homepage 📦 FastUMI Data Market 🔥FastUMI…

  • 5,360 downloads
  • 4 likes
  • not specified

vlabench_unified

Published Apr 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 10977, "total_frames": 3114872, "total_tasks": 295, "chunks_size": 1000,…

  • 326 downloads
  • 1 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

xperience-10m

Published Apr 2026 · other · ropedia-ai

⚠️ Important: If you have already submitted an access request but have not completed the required DocuSign agreement, your request will remain pending. Please complete signing and we will grant access once verified.

  • 188,007 downloads
  • 181 likes
  • 1M<n<10M
  • Gated access
  • 3d
  • Audio

UniHand_Preview

Published Apr 2026 · not specified · BeingBeyond

This data is a subset of the pretraining data for Being-H0.5. Citation Being-H0.5 @article{beingbeyond2026beingh05, title={Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization}, author={Luo, Hao and Wang, Ye…

  • 621 downloads
  • 5 likes
  • 10K<n<100K
  • Paper available
  • Text
  • Webdataset

PhysicalAI-Robotics-Open-H-Embodiment

Published Apr 2026 · cc-by-4.0 · nvidia

Dataset Description: Open-H-Embodiment is a community‑driven dataset initiative building the open, shared foundation needed to train and evaluate AI autonomy models for surgical robotics and ultrasound.

  • 37,477 downloads
  • 24 likes
  • n<1K
  • Image
  • Video

SuSuInterActs

Published Apr 2026 · cc-by-4.0 · Chuhaojin

SuSuInterActs Dataset A Large-Scale Multimodal Dialogue Corpus with Synchronized Speech, Full-Body Motion, and Facial Expressions From the paper: SentiAvatar: Towards Expressive and Interactive Digital Humans Overview SuSuInterActs is a…

  • 1,791 downloads
  • 8 likes
  • 10K<n<100K
  • Paper available

r3d

Published Apr 2026 · not specified · eddie-cui

R3D: Revisiting 3D Policy Learning Project Page | Paper | GitHub This repository contains the pre-processed data for R3D, a framework for robust and scalable 3D imitation learning.

  • 5,403 downloads
  • 1 likes
  • not specified
  • Paper available

OmniWorld

Published Apr 2026 · cc-by-nc-sa-4.0 · InternRobotics

[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling 🎉NEWS [2026.3.21] 🔥 OmniWorld-Game with Metric Scale is now released!

  • 200,677 downloads
  • 90 likes
  • 1B<n<10B
  • Paper available
  • Image
  • Text
  • Webdataset

TAMEn

Published Apr 2026 · cc-by-nc-sa-4.0 · OpenDriveLab-org

TAMEn: Tactile-Aware Manipulation Engine for Closed-Loop Data Collection in Contact-Rich Tasks 🎯 Overview TAMEn builds upon the UMI paradigm with key enhancements in multimodality, precision-portability synergy, replayability, and data…

  • 1,138 downloads
  • not specified
  • Paper available

Exylos_PickNPlace_Sample

Published Apr 2026 · cc-by-nc-4.0 · VadExylos

Exylos Pick & Place Sample Dataset Summary Exylos Pick & Place Sample is a compact multi-view robotics dataset in LeRobot-style structure for a single manipulation task: pick up an object and place it into a container.

  • 368 downloads
  • 1 likes
  • n<1K

robocasa_target_human_unified

Published Apr 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "robocasa", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 50, "chunks_size": 1000,…

  • 985 downloads
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

pick_cube_60mm

Published Apr 2026 · apache-2.0 · mfisch3r

Pick & Place Cube Dataset Collected for Agibot G2 robot

  • 980 downloads
  • 10K<n<100K
  • Tabular
  • Text
  • Parquet

metaworld_mt50

Published Apr 2026 · apache-2.0 · KentLuo

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…

  • 446 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

musclemimic-bimanual-retargeted

Published Apr 2026 · other · amathislab

MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoBimanualArm musculoskeletal model, generated using General Motion Retargeting (GMR).

  • 587 downloads
  • not specified
  • Paper available
  • Gated access

cache

Published Apr 2026 · bsd-3-clause · haichaozhang

ThinkJEPA Proprocessed Data Cache Dataset Description This repository contains the released preprocessed cache used by ThinkJEPA: ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model.

  • 9,744 downloads
  • 1 likes
  • 1K<n<10K
  • Paper available
  • Gated access
  • Text
  • Video
  • Text

G1_WBT_Inspire_Put_Vegetables_Into_Basket

Published Apr 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 641 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Inspire_Put_Drinks_Into_Fridge

Published Apr 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 481 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Dex1_Put_Clothes_into_Washing_Machine

Published Apr 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 394 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

test

Published Apr 2026 · apache-2.0 · DAVIAN-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 356, "total_frames": 103656, "total_tasks": 16, "chunks_size": 1000, "data_files_size_in_mb":…

  • 809 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

simtos_one_item_can_pet_trim_0.2

Published Apr 2026 · apache-2.0 · rainbowrobotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_size": 1000, "data_files_size_in_mb":…

  • 1,844 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

thesis-reconstructed-trajectories

Published Apr 2026 · mit · Sswa12

Reconstructed GNM Trajectories GNM-format trajectories reconstructed from RECON, SCAND, GoStanford2, and other navigation datasets. Used as training data for the CAST safety pipeline and Neural CBF.

  • 6,868 downloads
  • not specified
  • Image

record-test-lab

Published Apr 2026 · apache-2.0 · alex-cta

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "cta_ur_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "total_videos": 0, "total_chunks": 0,…

  • 523 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

lead

Published Apr 2026 · mit · ln2697

LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving Project Page | Paper | Code Official CARLA dataset accompanies our paper LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving.

  • 4,952 downloads
  • 2 likes
  • 1M<n<10M
  • Paper available

LeWAM_community_dataset_preprocessed

Published Apr 2026 · apache-2.0 · ehalicki

LeWAM Community Dataset (Preprocessed) A large-scale community-contributed robotics dataset for robot learning research featuring 318 datasets from the LeRobot community.

  • 1,652 downloads
  • n<1K
  • Video

leisaac-pick-orange-prepared

Published Apr 2026 · apache-2.0 · himanshu9series

LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original?

  • 375 downloads
  • n<1K
  • Video

droid-1.0.1-preprocessed

Published Apr 2026 · apache-2.0 · SeonghuJeon

DROID 1.0.1 — Preprocessed Mirror A ready-to-use mirror of lerobot/droid_1.0.1 with two preprocessing fixes baked in: Fixed meta/episodes/*.parquet timestamps.

  • 899 downloads
  • 10K<n<100K
  • Video

pick_up_single_colored_cube_into_tape_v7

Published Apr 2026 · apache-2.0 · wiscohumanoids

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 226, "total_frames": 120729, "total_tasks": 3, "chunks_size": 1000,…

  • 320 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

SNOW.SB

Published Apr 2026 · apache-2.0 · williamdgomez

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 91, "total_frames": 73578, "total_tasks": 1, "chunks_size": 1000,…

  • 2,452 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

computer-use-large

Published Apr 2026 · cc-by-4.0 · Sohai51515151

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 4,534 downloads
  • 10K<n<100K

omx_f_omx_blue_block_yellow_basket_random

Published Apr 2026 · apache-2.0 · dream-79

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "omx_f", "total_episodes": 99, "total_frames": 14858, "total_tasks": 1, "total_videos": 198, "total_chunks": 1,…

  • 368 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

RoboMIND

Published Apr 2026 · apache-2.0 · x-humanoid-robomind

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation Accepted by Robotics: Science and Systems (RSS) 2025.

  • 18,187 downloads
  • 39 likes
  • n>1T
  • Paper available
  • Gated access

data_grapes

Published Apr 2026 · apache-2.0 · ry-5

FR5 Garlic Manipulation Dataset This dataset contains demonstrations collected on an FR5 robot teleoperated to manipulate garlic.

  • 349 downloads
  • 100K<n<1M
  • Tabular
  • Video
  • Parquet

isaaclab-ur5-insertion-stacked-cuboid-randomized

Published Apr 2026 · apache-2.0 · DORLR

This dataset was created with the IsaacLab-WireManagement scripted agent. Total time: 45 minutes 56.85 seconds to generate 506 episodes.

  • 972 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

peract_lerobot_train

Published Apr 2026 · apache-2.0 · alesspalma

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1706, "total_frames": 71623, "total_tasks": 201, "total_videos": 0, "total_chunks": 2,…

  • 1,179 downloads
  • 1 likes
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

gr1_pickup_compact_h264

Published Apr 2026 · apache-2.0 · vedpatwardhan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 150, "total_frames": 7800, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb": 100,…

  • 948 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

psi0-apple-to-plate-teleop

Published Apr 2026 · apache-2.0 · cloudwalk-research

Psi0 Apple-to-Plate VR Teleoperation Dataset Human-demonstrated loco-manipulation trajectories for fine-tuning the Psi0 Vision-Language-Action (VLA) model on a Unitree G1 humanoid robot in Isaac Lab simulation.

  • 451 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Video
  • Parquet

0413

Published Apr 2026 · apache-2.0 · mkkimuser

This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 79017, "total_tasks":1, "total_videos": 1500,…

  • 1,445 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

soarm101_pickplace_multicolor_v1

Published Apr 2026 · apache-2.0 · edgarcancinoe

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 30, "total_frames": 27724, "total_tasks": 2, "chunks_size": 1000,…

  • 345 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

libero_jpeg

Published Apr 2026 · apache-2.0 · HecklesL

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "fps": 10.0, "splits": {…

  • 399 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

metaworld_mt50_jpeg

Published Apr 2026 · apache-2.0 · HecklesL

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…

  • 523 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

teleop-piper-0411

Published Apr 2026 · apache-2.0 · Pantheon-inc

teleop-piper-0411 Teleoperation dataset for the Agilex Piper robotic arm, collected April 11, 2026 across 3 rigs. Dual-camera (ego wrist + exo third-person), LeRobot v2 format.

  • 450 downloads
  • n<1K
  • Video

computer-use-large

Published Apr 2026 · cc-by-4.0 · helpechtox98988

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 14,562 downloads
  • 10K<n<100K

franka-fr3-manipulation

Published Apr 2026 · apache-2.0 · binsabit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 101, "total_frames": 21006, "total_tasks": 4, "chunks_size": 1000, "fps": 8, "splits": {…

  • 373 downloads
  • 10K<n<100K
  • Image
  • Tabular
  • Parquet

build_block_tower

Published Apr 2026 · apache-2.0 · villekuosmanen

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "arx5", "total_episodes": 60, "total_frames": 41832, "total_tasks": 1, "total_videos": 120, "total_chunks": 1,…

  • 322 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams

Published Apr 2026 · cc-by-4.0 · fuzirui

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a dataset from huggingface.

  • 21,084 downloads
  • 1 likes
  • n>1T
  • Paper available

bi_red_tomato

Published Apr 2026 · apache-2.0 · taetae77

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 304, "total_frames": 302309, "total_tasks": 1, "chunks_size": 1000,…

  • 471 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

vla-poursing-seeds-v1

Published Apr 2026 · apache-2.0 · trietlm0306

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 135, "total_frames": 65250, "total_tasks": 5, "chunks_size": 1000,…

  • 357 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

droid_1.0.1_trajectory_overlay

Published Apr 2026 · apache-2.0 · brandonyang

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 4444, "total_frames": 930500, "total_tasks": 1424, "total_videos": 0, "total_chunks": 5,…

  • 1,179 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

robomme

Published Apr 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1600, "total_frames": 768897, "total_tasks": 116, "chunks_size": 1000,…

  • 2,887 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Text
  • Parquet

fold_combined_gt

Published Apr 2026 · apache-2.0 · ywxia

This dataset was created using LeRobot. Dataset Description Data Distribution Overview This figure summarizes the data distribution of the ywxia/fold_combined_gt dataset, auto-generated after each conversion via…

  • 370 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

BAPData

Published Apr 2026 · cc-by-sa-4.0 · Stardust-hyx

BAPData (EFM-10 Benchmark) Project Website | Paper BAPData is a dataset introduced in the paper "Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark and Strategy".

  • 1,275 downloads
  • 1 likes
  • not specified
  • Paper available
  • Image

Sim1_Dataset

Published Apr 2026 · apache-2.0 · InternRobotics

Sim1_Dataset LeRobot-format manipulation dataset for cloth folding / deformable object tasks. Format This repository follows a LeRobot-style layout per subset: data/chunk-xxx/episode_XXXXXX.parquet: frame-level trajectories…

  • 8,119 downloads
  • 5 likes
  • 1M<n<10M
  • Paper available
  • Gated access
  • Tabular
  • Timeseries

eval_real_so101_record_v1

Published Apr 2026 · apache-2.0 · yangxinye

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 751, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 354 downloads
  • 2 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

real_so101_record_v1

Published Apr 2026 · apache-2.0 · yangxinye

real_so101_record_v1 A real-world LeRobot-format dataset collected on an SO101/SO100-style follower arm with three camera views.

  • 324 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

PDD

Published Apr 2026 · cc-by-nc-4.0 · tasl-lab

PDD: Personalized Driving Dataset Dataset Description PDD (Personalized Driving Dataset) is a multi-driver, multi-scenario driving dataset collected in CARLA 0.9.15.

  • 5,746 downloads
  • 10K<n<100K
  • Paper available
  • Image

droid_1.0.1

Published Apr 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630, "chunks_size": 1000,…

  • 15,868 downloads
  • 14 likes
  • 10M<n<100M
  • Tabular
  • Text
  • Parquet

ManiTwin-100K

Published Apr 2026 · cc-by-nc-4.0 · ManiTwin

ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research.

  • 44,824 downloads
  • 12 likes
  • 100K<n<1M
  • Paper available

dk1_duplo_in_box_two_2026-04-09

Published Apr 2026 · apache-2.0 · andreaskoepf

This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks in the container Episodes: 101 Total…

  • 406 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

dk1_duplo_in_box_one_2026-04-09

Published Apr 2026 · apache-2.0 · andreaskoepf

This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks in the container Episodes: 101 Total…

  • 623 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

rl_pick_up_block_pi_rlt

Published Apr 2026 · apache-2.0 · villekuosmanen

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5", "total_episodes": 164, "total_frames": 34029, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 366 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

GR1-Tabletop-NextState-1000x24

Published Apr 2026 · apache-2.0 · Joocjun

GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobot v2.0 format.

  • 5,976 downloads
  • 1M<n<10M
  • Tabular
  • Video
  • Parquet

Master_wire_hook_screwing_20260408

Published Apr 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 235, "total_frames": 224240, "total_tasks": 1, "chunks_size": 1000,…

  • 803 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

GR1-Tabletop-NextState-300x24

Published Apr 2026 · apache-2.0 · Joocjun

GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobot v2.0 format.

  • 1,385 downloads
  • 1M<n<10M
  • Tabular
  • Video
  • Parquet

GR1-Tabletop-NextState-100x24

Published Apr 2026 · apache-2.0 · Joocjun

GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobot v2.0 format.

  • 1,123 downloads
  • 100K<n<1M
  • Tabular
  • Video
  • Parquet

piper_piper_pickdrop_bottle_70

Published Apr 2026 · apache-2.0 · DRMNmadhan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 70, "total_frames": 42041, "total_tasks": 1, "total_videos": 210, "total_chunks": 1,…

  • 319 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Optimized Parquet

libero_spatial_cot

Published Apr 2026 · apache-2.0 · jayshim

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "total_videos": 0, "total_chunks": 2,…

  • 442 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

embodied_features_and_demos_libero

Published Apr 2026 · mit · Embodied-CoT

Dataset for Embodied Chain-of-Thought Reasoning for LIBERO-90, as used by ECoT-Lite. TFDS Demonstration Data The TFDS dataset contains successful demonstration trajectories for LIBERO-90 (50 trajectories for each of 90 tasks).

  • 758 downloads
  • 4 likes
  • not specified
  • Paper available

lerobot_cli_mobile_block_transfer_v2

Published Apr 2026 · apache-2.0 · TrossenRoboticsCommunity

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_mobile", "total_episodes": 48, "total_frames": 28479, "total_tasks": 1,…

  • 322 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

dk1_duplo_in_box_2026-04-08

Published Apr 2026 · apache-2.0 · andreaskoepf

This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks in the container Episodes: 106 Total…

  • 509 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

dk1-merge-2026-03

Published Apr 2026 · apache-2.0 · andreaskoepf

DK-1 Merged Dataset This dataset was created using LeRobot. Dataset Description Merged and deduplicated DK-1 bimanual robot dataset. All source videos are re-encoded to 640x360 h264 with 14D joint-space actions.

  • 874 downloads
  • 1 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

ww_dataset_abs_joint_abs_jointnapkin_only

Published Apr 2026 · apache-2.0 · pmoller

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ur5", "total_episodes": 100, "total_frames": 12696, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 342 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

so101_gba_direct_manipulation

Published Apr 2026 · apache-2.0 · giacomoran

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_leader", "total_episodes": 200, "total_frames": 47567, "total_tasks": 1, "chunks_size": 1000,…

  • 562 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

lerobot_derek_depth

Published Apr 2026 · apache-2.0 · DerekLX

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "chunks_size": 1000, "data_files_size_in_mb":…

  • 3,160 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

ffw_sh5_dataset

Published Apr 2026 · apache-2.0 · learner1119

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ffw_sh5", "total_episodes": 2, "total_frames": 184, "total_tasks": 1, "total_videos": 2, "total_chunks": 1,…

  • 403 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

pick_block

Published Apr 2026 · apache-2.0 · d3d3shan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 40, "total_frames": 22683, "total_tasks": 1, "chunks_size": 1000,…

  • 463 downloads
  • 2 likes
  • 10K<n<100K
  • Tabular
  • Timeseries

V2X-Radar

Published Apr 2026 · cc-by-nc-nd-4.0 · yanglei18

V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception Lei Yang · Xinyu Zhang · Jun Li · Chen Wang · Jiaqi Ma · Zhiying Song · Tong Zhao · Ziying Song · Li Wang · Mo Zhou · Yang Shen · Kai Wu · Chen Lv This is the…

  • 345 downloads
  • 6 likes
  • 10K<n<100K
  • Paper available

aera_semi_pnp_dr_02_04_2026_skip3_delta_no_go_home

Published Apr 2026 · apache-2.0 · Purple69

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 337627, "total_tasks": 72, "chunks_size": 1000,…

  • 509 downloads
  • 100K<n<1M
  • Image
  • Text
  • Parquet

leisaac-pick-orange-prepared

Published Apr 2026 · apache-2.0 · rajeshramana

LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original?

  • 386 downloads
  • n<1K
  • Video

simtos_one_item_can_pet_trim_0.2

Published Apr 2026 · apache-2.0 · kaiseong

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_size": 1000, "data_files_size_in_mb":…

  • 1,202 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

shawn-pickup2

Published Apr 2026 · apache-2.0 · allday-technology

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_stationary", "total_episodes": 1, "total_frames": 293, "total_tasks": 1,…

  • 320 downloads
  • 1 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

Task_0002_OrderPicking_lerobot

Published Apr 2026 · apache-2.0 · ROBOTIS

This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "total_episodes": 857, "total_frames": 85474, "total_videos": 2828, "codebase_version": "v2.1", "robot_type": "ffw_bg2_rev4", "total_tasks":…

  • 2,532 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

libero90_successes

Published Apr 2026 · apache-2.0 · arif101

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 272, "total_frames": 11334, "total_tasks": 32, "total_videos": 0, "total_chunks": 1,…

  • 324 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

RoboCerebra_lerobot_v2.1

Published Apr 2026 · apache-2.0 · CollisionCode

RoboCerebra (LeRobot v2.1 Format) Dataset Description This dataset is a converted version of RoboCerebra, now formatted for compatibility with the LeRobot ecosystem (v2.1).

  • 9,034 downloads
  • 1 likes
  • 10K<n<100K
  • Video

stationaray_sdk_edit_test_a

Published Apr 2026 · apache-2.0 · TrossenRoboticsCommunity

This dataset was created using LeRobot. Dataset Description Converted from TrossenMCAP format using the Trossen SDK trossen_mcap_to_lerobot_v2 tool.

  • 2,432 downloads
  • 1K<n<10K
  • Image
  • Tabular
  • Parquet

LIBERO-Para

Published Apr 2026 · mit · HAI-Lab

LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models LIBERO-Para is a controlled benchmark for evaluating the paraphrase robustness of Vision-Language-Action (VLA) models.

  • 1,870 downloads
  • 8 likes
  • 1K<n<10K
  • Paper available

metaworld_mt50

Published Apr 2026 · apache-2.0 · Jiaxin1234

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…

  • 483 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

aera_semi_pnp_dr_02_04_2026

Published Apr 2026 · apache-2.0 · Purple69

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 1345076, "total_tasks": 72, "chunks_size": 1000,…

  • 1,281 downloads
  • 1M<n<10M
  • Image
  • Text
  • Parquet

OmniAction

Published Apr 2026 · cc-by-nc-4.0 · hosam12kalad

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…

  • 33,980 downloads
  • not specified
  • Paper available

community_dataset_v1

Published Apr 2026 · apache-2.0 · azaracla

Community Dataset v1 (v3.0) A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 119 datasets from 52 contributors worldwide.

  • 731 downloads
  • 100K<n<1M
  • Paper available

eval_bi_so101_round1_4cam

Published Apr 2026 · apache-2.0 · Zekai-Chen

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 601, "total_frames": 637163, "total_tasks": 1, "chunks_size": 1000,…

  • 479 downloads
  • 100K<n<1M
  • Tabular
  • Text
  • Parquet

libero_spatial_no_noops_abs

Published Apr 2026 · apache-2.0 · locht131

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 415, "total_frames": 50744, "total_tasks": 10, "total_videos": 830, "total_chunks": 1,…

  • 1,081 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

bi-so101-insert-screw-271ep

Published Apr 2026 · apache-2.0 · Elvinky

bi-so101-insert-screw-271ep Bimanual teleoperation dataset for the task "Insert the copper screw into the black sleeve", collected with SO-101 dual arms and LeRobot 0.5.0.

  • 325 downloads
  • not specified

dronedata_new_lerobot

Published Apr 2026 · apache-2.0 · Celina717

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "uav", "total_episodes": 799, "total_frames": 79900, "total_tasks": 2, "total_videos": 0, "total_chunks": 1,…

  • 755 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

robocasa_target

Published Apr 2026 · apache-2.0 · BrunoM42

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 1221, "chunks_size": 1000,…

  • 671 downloads
  • 10M<n<100M
  • Tabular
  • Video
  • Parquet

piper_pickdrop_fooditems_101

Published Apr 2026 · apache-2.0 · DRMNmadhan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120066, "total_tasks": 1, "total_videos": 300, "total_chunks": 1,…

  • 831 downloads
  • 1 likes
  • n<1K
  • Video

agent-reward-bench

Published Apr 2026 · not specified · yuanyyaa

AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Meade, Arkil Patel, Dongchan Shin,…

  • 7,568 downloads
  • 1K<n<10K
  • Paper available
  • Image
  • Text
  • Csv

Master_wire_bend_varying_3wire_20260401

Published Apr 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 290, "total_frames": 814263, "total_tasks": 7, "chunks_size": 1000,…

  • 811 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

droid-cube-stacking-order

Published Apr 2026 · apache-2.0 · jdoo2

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Panda", "total_episodes": 108, "total_frames": 44097, "total_tasks": 1, "total_videos": 324, "total_chunks": 1,…

  • 835 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

G1_Brainco_Toast_Dataset

Published Apr 2026 · apache-2.0 · n1robotics

This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo.

  • 329 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

G1_Brainco_StackCups_Dataset

Published Apr 2026 · apache-2.0 · n1robotics

This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo.

  • 325 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

aloha_sim_demos

Published Apr 2026 · apache-2.0 · shivakanthsujit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "aloha", "total_episodes": 10, "total_frames": 2568, "total_tasks": 1, "total_videos": 30, "total_chunks": 1,…

  • 432 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

libero_goal_no_noops_abs

Published Apr 2026 · apache-2.0 · locht131

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 413, "total_frames": 49801, "total_tasks": 10, "total_videos": 826, "total_chunks": 1,…

  • 586 downloads
  • n<1K
  • Video

RoboCerebra_hdf5

Published Apr 2026 · apache-2.0 · CollisionCode

📦 RoboCerebra HDF5 Dataset This repository contains the structured HDF5 dataset for the RoboCerebra project, optimized for high-performance robot learning training. ✅ Status: Data Upload Complete.

  • 4,938 downloads
  • not specified
  • Video

Master_wire_bend_varying_3wire_20260327

Published Apr 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 221, "total_frames": 638395, "total_tasks": 7, "chunks_size": 1000,…

  • 445 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

libero_merged

Published Apr 2026 · apache-2.0 · HollyTan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "data_files_size_in_mb":…

  • 751 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Optimized Parquet

so101-puzzle-v9

Published Apr 2026 · apache-2.0 · aivanni

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 53, "total_frames": 47059, "total_tasks": 1, "chunks_size": 1000,…

  • 611 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

0402data

Published Apr 2026 · apache-2.0 · mkkimuser

This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 82092, "total_tasks": 1, "total_videos": 1500,…

  • 2,955 downloads
  • n<1K
  • Tabular
  • Timeseries
  • Parquet

Bright2026_19382

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19382 (别碰我爪子) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 1,773 downloads
  • n<1K
  • Video

Bright2026_19380

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19380 (MAEN) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 1,628 downloads
  • not specified

Bright2026_19361

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19361 (DynamicX) Competition task: 环套在柱子上 (Put Ring onto Rod) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 1,568 downloads
  • n<1K
  • Video

Bright2026_19359

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19359 (炬亮启航) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 683 downloads
  • not specified

Bright2026_19353

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19353 (模型学的全队) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 658 downloads
  • n<1K
  • Video

Bright2026_19234

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19234 (Apex) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 1,451 downloads
  • not specified

Bright2026_19229

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19229 (MVP) Competition task: 抓水果 (Fruit Grasping) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 2,515 downloads
  • not specified

so101_pick_place

Published Apr 2026 · mit · QianGroup

Pick object and place in box Project KIWI | Qian Group HRI Lab | University of Houston Detail Value Task Pick object and place in box Episodes 50 FPS 30 Cameras Gripper (Arducam) + Overhead (Intel RealSense) Robot SO-101 (Feetech STS3215)…

  • 4,286 downloads
  • 10K<n<100K
  • Image
  • Imagefolder

Bright2026_19159

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19159 (Robot大王) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 500 downloads
  • not specified

Agilex_Cobot_Magic_storage_towel

Published Apr 2026 · apache-2.0 · RoboCOIN

Agilex_Cobot_Magic_storage_towel Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.

  • 5,830 downloads
  • not specified
  • Paper available

Bright2026_19153

Published Apr 2026 · apache-2.0 · ByteRainx

Bright 2026 Benchmark - Robot 19153 (同济子豪兄 (张子豪团队)) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).

  • 351 downloads
  • n<1K
  • Video

Agilex_Cobot_Magic_place_towel_flat

Published Apr 2026 · apache-2.0 · RoboCOIN

Agilex_Cobot_Magic_place_towel_flat Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.

  • 7,053 downloads
  • not specified
  • Paper available

assemble_box_with_phone_stand_test

Published Apr 2026 · apache-2.0 · Xense

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_flexiv_rizon4_rt", "total_episodes": 35, "total_frames": 315631, "total_tasks": 1, "chunks_size": 1000,…

  • 431 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

Master_wire_bend_varying_3wire_20260319

Published Apr 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "chunks_size": 1000,…

  • 563 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

Agilex_Cobot_Magic_heat_burger

Published Apr 2026 · apache-2.0 · RoboCOIN

Agilex_Cobot_Magic_heat_burger Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.

  • 6,311 downloads
  • not specified
  • Paper available

pick_cube_put_in_the_box_img3

Published Apr 2026 · apache-2.0 · ClOBOT

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 100, "total_frames": 147066, "total_tasks": 1, "total_videos": 300,…

  • 789 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

pick_cube_put_in_the_box

Published Apr 2026 · apache-2.0 · ClOBOT

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "total_videos": 0, "total_chunks": 0,…

  • 443 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

piper_pickdrop_fooditems_100

Published Apr 2026 · apache-2.0 · DRMNmadhan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120081, "total_tasks": 1, "total_videos": 300, "total_chunks": 1,…

  • 472 downloads
  • not specified
  • Video

InternData-A1-LeRobot-v3.0-by-embodiment

Published Apr 2026 · cc-by-nc-sa-4.0 · yixuan-tan

InternData-A1 dataset taken from InternRobotics/InternData-A1, with the tarballs extracted and directory structure "transposed" so that the top-level subdirectories are the four embodiments.

  • 25,327 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries

wm-eval-samples-ewmbench

Published Apr 2026 · cc-by-nc-sa-4.0 · Physis-AI

WM-Eval Samples: EWMBench Generated Videos Pre-generated video samples for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework.

  • 5,057 downloads
  • 1K<n<10K
  • Paper available
  • Image
  • Imagefolder

wm-eval-gt-ewmbench

Published Apr 2026 · cc-by-nc-sa-4.0 · Physis-AI

WM-Eval GT: EWMBench Ground Truth Ground truth data for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework. Source Extracted from agibot-world/EWMBench (gt_dataset.tar).

  • 3,268 downloads
  • 1K<n<10K
  • Paper available
  • Image
  • Imagefolder

Bridge-CoT

Published Apr 2026 · cc-by-4.0 · CliffKai

Bridge-CoT Robot manipulation dataset with chain-of-thought annotations, derived from BridgeDataV2. Each sample pairs a scene image with a task description and structured VLM-generated annotations including object detection, spatial…

  • 7,252 downloads
  • not specified

record-test-realsense-depth-14

Published Apr 2026 · apache-2.0 · av120

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 499, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 927 downloads
  • n<1K
  • Image
  • Tabular
  • Parquet

robocasa_depth_test

Published Apr 2026 · apache-2.0 · kimz1121

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 2, "total_frames": 309, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 1,644 downloads
  • n<1K
  • Tabular
  • Timeseries
  • Parquet

libero

Published Apr 2026 · cc-by-4.0 · sunli1201

This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10.

  • 1,440 downloads
  • 100K<n<1M
  • Paper available
  • Image
  • Timeseries
  • Parquet

Robotwin2.0-lerobot

Published Apr 2026 · apache-2.0 · Traly

RoboTwin2.0 - LeRobot v3.0 RoboTwin2.0 converted to LeRobot v3.0 format. Contains both joint-space and end-effector (EE) pose data for bimanual manipulation across 5 robot embodiments.

  • 805 downloads
  • 100K<n<1M
  • Paper available

NIRANJAN_wire_hook_20260331

Published Apr 2026 · not specified · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 151, "total_frames": 89646, "total_tasks": 1, "chunks_size": 1000,…

  • 1,092 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

1stpass10_splatsim_approach_lever_benchmark_1000_fast

Published Mar 2026 · apache-2.0 · JennyWWW

notes to self: this is using the faster teleop (aka no velocity limiting) This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lerobot_splatsim", "total_episodes": 1012,…

  • 410 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Optimized Parquet

OmniAction

Published Mar 2026 · cc-by-nc-4.0 · rad1d1m123

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…

  • 63,389 downloads
  • not specified
  • Paper available

Master_wire_bend_varying_3wire_20260319_fix

Published Mar 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "chunks_size": 1000,…

  • 346 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

LeWAM_community_dataset

Published Mar 2026 · apache-2.0 · ehalicki

LeWAM Community Dataset A large-scale community-contributed robotics dataset for robot learning research featuring 321 datasets from the LeRobot community. Derived from Community Dataset v2, and converted to LeRobotDataset v3.0.

  • 3,000 downloads
  • 1 likes
  • n<1K
  • Video

maniparena-dataset

Published Mar 2026 · apache-2.0 · ManipArena

ManipArena Dataset Training dataset for ManipArena, a real-robot benchmark and competition for bimanual manipulation at the CVPR 2026 Embodied AI Workshop.

  • 30,062 downloads
  • 15 likes
  • not specified
  • Paper available
  • Gated access

level2_final_quality3

Published Mar 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks": 1, "chunks_size": 1000,…

  • 450 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Robotics-Manipulation-Kitchen-Demos

Published Mar 2026 · cc-by-4.0 · nvidia

PhysicalAI-Robotics-Manipulation-Kitchen-Demos We provide a 600 hours of human-teleoperated demonstrations across 316 different tasks, totalling 55k trajectories.

  • 30,417 downloads
  • 40 likes
  • not specified

so101-puzzle-v4

Published Mar 2026 · apache-2.0 · aivanni

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 37, "total_frames": 36234, "total_tasks": 1, "chunks_size": 1000,…

  • 658 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Published Mar 2026 · cc-by-4.0 · sini-21

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robot embodiments and tasks.

  • 126,607 downloads
  • not specified

computer-use-large

Published Mar 2026 · cc-by-4.0 · NeonoV1

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 3,389 downloads
  • 10K<n<100K

Master_wire_hook_20260327

Published Mar 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 128, "total_frames": 74668, "total_tasks": 1, "chunks_size": 1000,…

  • 337 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

Master_wire_hook_20260326

Published Mar 2026 · apache-2.0 · Factory-Intelligence

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 111, "total_frames": 62196, "total_tasks": 1, "chunks_size": 1000,…

  • 343 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

OmniAction

Published Mar 2026 · cc-by-nc-4.0 · OpenMOSS-Team

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…

  • 47,824 downloads
  • 279 likes
  • not specified
  • Paper available

Bench2Drive-Speed

Published Mar 2026 · cc-by-nc-nd-4.0 · rethinklab

Bench2Drive-Speed Project Page | Paper | GitHub Bench2Drive-Speed is a closed-loop benchmark for desired-speed conditioned autonomous driving, enabling explicit control over vehicle behavior through target speed and overtake/follow…

  • 3,478 downloads
  • not specified
  • Paper available

portafilter-regrasp-v6-slow-sticker-merged-remapped-clean

Published Mar 2026 · apache-2.0 · apaszynska

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 221, "total_frames": 203035, "total_tasks": 1, "chunks_size": 1000,…

  • 360 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_DiverseManip_SingleArm_128x128

Published Mar 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018].

  • 458 downloads
  • 3 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

InternData-A1

Published Mar 2026 · not specified · InternRobotics

InternData-A1 InternData-A1 is a hybrid synthetic-real manipulation dataset containing over 630k trajectories and 7,433 hours across 4 embodiments, 18 skills, 70 tasks, and 227 scenes, covering rigid, articulated, deformable, and…

  • 36,451 downloads
  • 88 likes
  • n>1T
  • Paper available
  • Gated access
  • 3d
  • Image

G1_WBT_Inspire_Put_Clothes_Into_Basket

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 1,395 downloads
  • 4 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Brainco_Make_The_Bed

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 1,002 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Inspire_Put_Clothes_into_Washing_Machine

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 1,999 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Brainco_Pickup_Pillow

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 1,855 downloads
  • 4 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Brainco_Collect_Plates_Into_Dishwasher

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 2,347 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Inspire_Put_Clothes_into_Washing_Machine_MainCamOnly

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 687 downloads
  • 5 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Inspire_Pickup_Pillow_MainCamOnly

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 998 downloads
  • 6 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Inspire_Collect_Clothes_MainCamOnly

Published Mar 2026 · apache-2.0 · unitreerobotics

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.

  • 1,143 downloads
  • 12 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

OmniAction-LIBERO

Published Mar 2026 · cc-by-nc-4.0 · OpenMOSS-Team

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…

  • 3,565 downloads
  • 69 likes
  • not specified
  • Paper available

DOMINO

Published Mar 2026 · apache-2.0 · H-EmbodVis

Towards Generalizable Robotic Manipulation in Dynamic Environments Heng Fang1, Shangru Li1, Shuhan Wang1, Xuanyang Xi2, Dingkang Liang1, Xiang Bai1 1 Huazhong University of Science and Technology, 2 Huawei Technologies Co.

  • 2,849 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available

pp_fold_fix

Published Mar 2026 · apache-2.0 · mjung11

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks": 4, "chunks_size": 1000,…

  • 352 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

RoboTwin_stack_bowls_three_randomized

Published Mar 2026 · apache-2.0 · SUZ-tsinghua

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 223728, "total_tasks": 343, "chunks_size": 1000, "data_files_size_in_mb":…

  • 343 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

RoboTwin_put_bottles_dustbin_randomized

Published Mar 2026 · apache-2.0 · SUZ-tsinghua

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 297520, "total_tasks": 500, "chunks_size": 1000, "data_files_size_in_mb":…

  • 1,043 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

full_ttt_redx_stable_fixed_short_labels

Published Mar 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9, "chunks_size": 1000,…

  • 339 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

act_table_to_tray

Published Mar 2026 · apache-2.0 · Fiberal

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 50, "total_frames": 52868, "total_tasks": 1, "chunks_size": 1000,…

  • 724 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

full_ttt_redx_stable_fixed_labels

Published Mar 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9, "chunks_size": 1000,…

  • 321 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

MarketGen

Published Mar 2026 · cc-by-nc-sa-4.0 · HXX

MarketGen : A Scalable Simulation Platform with Auto-Generated Embodied Supermarket Environments 📑 arXiv | 🌐 Project | 🤗 Hugging Face MarketGen is a scalable simulation platform with automatic scene generation for embodied supermarket…

  • 2,838 downloads
  • 1 likes
  • not specified
  • Paper available
  • Image

360DVO

Published Mar 2026 · cc-by-nc-sa-4.0 · chris1004336379

360DVO Dataset This is the official dataset from the paper 360DVO: Deep Visual Odometry for Monocular 360-Degree Camera, which is published on IEEE Robotics and Automation Letters (RA-L) 2026.

  • 1,013 downloads
  • 4 likes
  • 1K<n<10K
  • Paper available
  • Image
  • Imagefolder

humanoid-robots-training-dataset

Published Mar 2026 · cc-by-nc-4.0 · DynamicIntelligence

Dynamic Intelligence — Humanoid Robot Training Dataset A first-person (egocentric) video dataset of human hand manipulation, designed for training humanoid robot policies via imitation learning.

  • 407 downloads
  • 10K<n<100K
  • Tabular
  • Text
  • Parquet

pp_fold

Published Mar 2026 · apache-2.0 · mjung11

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks": 4, "chunks_size": 1000,…

  • 320 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

logu_bimanual_joint_multi_task

Published Mar 2026 · apache-2.0 · gaozj

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 212, "total_frames": 101703, "total_tasks": 2, "total_videos": 636, "total_chunks": 1,…

  • 363 downloads
  • n<1K
  • Video

computer-use-large

Published Mar 2026 · cc-by-4.0 · Whiteglove44

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 3,547 downloads
  • 10K<n<100K

computer-use-large

Published Mar 2026 · cc-by-4.0 · nawed

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 3,488 downloads
  • 10K<n<100K

redx_stable

Published Mar 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9, "chunks_size": 1000,…

  • 338 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

mimic_ttt_redx_ALLBAL_15hz_stable

Published Mar 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 18, "chunks_size": 1000,…

  • 342 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

computer-use-large

Published Mar 2026 · cc-by-4.0 · REXX-NEW

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 37,008 downloads
  • 10K<n<100K

computer-use-large

Published Mar 2026 · cc-by-4.0 · AGuyWithAnAI

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 3,435 downloads
  • 10K<n<100K

computer-use-xl

Published Mar 2026 · cc-by-4.0 · AGuyWithAnAI

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 3,362 downloads
  • 10K<n<100K

mimic_ttt_redx_30hz_x2_ALL

Published Mar 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 338, "total_frames": 364063, "total_tasks": 11, "chunks_size": 1000,…

  • 443 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

ManiTwin-100K

Published Mar 2026 · apache-2.0 · Dhanush944

ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research.

  • 12,690 downloads
  • 100K<n<1M
  • Paper available

FuSe

Published Mar 2026 · mit · riverfog7

The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments, annotated with natural language.

  • 3,274 downloads
  • not specified
  • Paper available

robomimic-pretrain-data

Published Mar 2026 · not specified · wintermelontree

From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning (DICE-RL) This repository contains the datasets used in the paper From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning.

  • 450 downloads
  • 2 likes
  • not specified
  • Paper available

ATG-MoE_TrainingSet

Published Mar 2026 · mit · failo0711

ATG-MoE Pressure-Reducing Valve Assembly Training Set This repository provides the training set used in our paper: "ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning".

  • 1,913 downloads
  • n<1K
  • Paper available

ur10e_real_cube_merge

Published Mar 2026 · apache-2.0 · cupnb

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ros2", "total_episodes": 357, "total_frames": 334477, "total_tasks": 4, "chunks_size": 1000, "data_files_size_in_mb":…

  • 406 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

merged_diffrenet_grips

Published Mar 2026 · apache-2.0 · apaszynska

merged_diffrenet_grips Dataset for RECAP (RL with Experience and Corrections via Advantage-conditioned Policies) training.

  • 383 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

UrbanVerse-100K

Published Mar 2026 · odc-by · Oatmealliu

UrbanVerse-100K Dataset [!NOTE] UrbanVerse-100K is a large-scale, physics-aware 3D asset and material database curated for urban simulation, physical and embodied AI research.

  • 3,530 downloads
  • 20 likes
  • 100K<n<1M
  • Paper available
  • Gated access
  • 3d

oxford_spires_dataset

Published Mar 2026 · cc-by-nc-sa-4.0 · ori-drs

We present the Oxford Spires Dataset, captured in and around well-known landmarks in Oxford using a custom-built multi-sensor perception unit as well as a millimetre-accurate map from a terrestrial LiDAR scanner (TLS).

  • 3,196 downloads
  • 6 likes
  • not specified
  • Paper available

dAgger_bin_pick_pack_coffee_capsules_1.5.0

Published Mar 2026 · apache-2.0 · villekuosmanen

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 80, "total_frames": 24266, "total_tasks": 1, "chunks_size": 1000,…

  • 456 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

bin_pick_pack_coffee_capsules

Published Mar 2026 · apache-2.0 · villekuosmanen

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 200, "total_frames": 47865, "total_tasks": 1, "chunks_size": 1000,…

  • 739 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

mimic_ttt_redBalanced_30hz

Published Mar 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 340, "total_frames": 289608, "total_tasks": 9, "chunks_size": 1000,…

  • 362 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_DiverseManip_DualArm_128x128

Published Mar 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018].

  • 736 downloads
  • 2 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_DiverseManip_DualArm_256x256

Published Mar 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018].

  • 476 downloads
  • 2 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

SynthVerse

Published Mar 2026 · cc-by-nc-sa-4.0 · InternRobotics

Viewer is explicitly configured to read the parquet split only. Citation If you find this dataset useful, please cite: @article{zhao2026SythnVerse, title={SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking},…

  • 972 downloads
  • 13 likes
  • 1K<n<10K
  • Paper available
  • Image
  • Text
  • Optimized Parquet

ArtVIP

Published Mar 2026 · apache-2.0 · X-Humanoid

ArtVIP dataset card 🎉🎉🎉 ArtVIP is accepted by ICLR 2026 Key Features ✅ 206 high-quality digital-twin articulated objects.

  • 15,632 downloads
  • 44 likes
  • not specified
  • Paper available

scenesmith-example-scenes

Published Mar 2026 · apache-2.0 · nepfaff

SceneSmith Example Scenes Project Page | Paper | Code Example scenes generated by SceneSmith, a hierarchical agentic framework for constructing simulation-ready indoor environments from natural language prompts.

  • 4,597 downloads
  • 6 likes
  • 1K<n<10K
  • Paper available

computer-use-large

Published Mar 2026 · cc-by-4.0 · markov-ai

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 28,380 downloads
  • 172 likes
  • 10K<n<100K

mimic_simpleClothFolding_v1

Published Mar 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 222, "total_frames": 384490, "total_tasks": 1, "chunks_size": 1000,…

  • 388 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

aloha_static_battery_ep005_009

Published Mar 2026 · apache-2.0 · macrodata

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 527 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

aloha_static_battery_ep000_004

Published Mar 2026 · apache-2.0 · macrodata

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 524 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

syn_realDataset

Published Mar 2026 · mit · Lusmse

Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to-synthetic data ratios.

  • 893 downloads
  • not specified

computer-use-large

Published Mar 2026 · cc-by-4.0 · bitsydarel

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.

  • 3,496 downloads
  • 10K<n<100K

Edge-Agent-Reasoning-WebSearch-260K

Published Mar 2026 · mit · yatin-superintelligence

Edge Agent Reasoning WebSearch 260K Abstract The Edge-Agent-Reasoning-WebSearch-260K dataset is a massive, synthetically expert-engineered corpus of over 700 Million tokens, designed to train small, local models (SLMs) and edge-deployed…

  • 5,993 downloads
  • 44 likes
  • 100K<n<1M
  • 3d
  • Audio
  • Parquet

sd

Published Mar 2026 · mit · Lusmse

Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to-synthetic data ratios.

  • 446 downloads
  • not specified

bi_so101_baseline_4cam

Published Mar 2026 · apache-2.0 · Zekai-Chen

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 303, "total_frames": 373511, "total_tasks": 1, "chunks_size": 1000,…

  • 340 downloads
  • 100K<n<1M
  • Tabular
  • Text
  • Parquet

test_dataset

Published Mar 2026 · apache-2.0 · RonPlusSign

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "dianarm", "total_episodes": 26, "total_frames": 6847, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…

  • 348 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

HuMI-Raw-Data

Published Mar 2026 · cc-by-4.0 · Richard-Nai

Raw Data for HuMI [Project Page] | [Paper] Dataset Summary This dataset contains raw data collected using the HuMI data collection pipeline. It includes MP4 videos and tracker trajectories.

  • 1,511 downloads
  • not specified
  • Paper available

trlc_tshirt_folding

Published Mar 2026 · apache-2.0 · Gongsta

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 532, "total_frames": 1106053, "total_tasks": 1, "chunks_size": 1000,…

  • 694 downloads
  • 1 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

rbm-1m-ood

Published Mar 2026 · apache-2.0 · robometer

RBM-1M-OOD evaluation dataset used in Robometer. It contains over 1k trajectories used for evaluation of general-purpose reward models.

  • 390 downloads
  • 1 likes
  • 1K<n<10K
  • Paper available
  • Text
  • Video

koch_pick_place_5_lego_random_pose

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 36013, "total_tasks": 1, "chunks_size": 1000, "fps": 30, "splits": {…

  • 627 downloads
  • 3 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

koch_pick_place_1_lego

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 39693, "total_tasks": 1, "chunks_size": 1000, "fps": 30, "splits": {…

  • 636 downloads
  • 2 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

berkeley_gnm_recon

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 11834, "total_frames": 610907, "total_tasks": 1, "chunks_size": 1000, "fps": 3, "splits": {…

  • 1,039 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

berkeley_cable_routing

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1647, "total_frames": 42328, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…

  • 2,147 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

aloha_static_thread_velcro

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 34300, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 493 downloads
  • 2 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

aloha_static_fork_pick_up

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 60000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 846 downloads
  • 3 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

aloha_static_battery

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 29400, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 652 downloads
  • 1 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

aloha_mobile_shrimp

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 18, "total_frames": 67500, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 1,211 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

umi_cup_in_the_wild

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1447, "total_frames": 699432, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…

  • 642 downloads
  • 6 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

usc_cloth_sim

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 100000, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…

  • 1,471 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

imperialcollege_sawyer_wrist_cam

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 170, "total_frames": 7148, "total_tasks": 17, "chunks_size": 1000, "fps": 5, "splits": {…

  • 561 downloads
  • 2 likes
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

utokyo_xarm_pick_and_place

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 7490, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…

  • 499 downloads
  • 1 likes
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

level2_final_quality3_t_0_hil_data_c

Published Mar 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks": 1, "chunks_size": 1000,…

  • 1,488 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

roboturk

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1995, "total_frames": 187507, "total_tasks": 3, "chunks_size": 1000, "fps": 10, "splits": {…

  • 613 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

libero_spatial_image

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…

  • 1,293 downloads
  • 7 likes
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

libero_object_image

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 454, "total_frames": 66984, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…

  • 1,236 downloads
  • 2 likes
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

libero_goal_image

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 428, "total_frames": 52042, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…

  • 678 downloads
  • 5 likes
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

libero_10_image

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…

  • 1,450 downloads
  • 4 likes
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

fmb

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1804, "total_frames": 338188, "total_tasks": 24, "chunks_size": 1000, "fps": 10, "splits": {…

  • 859 downloads
  • 4 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

conq_hose_manipulation

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 139, "total_frames": 8277, "total_tasks": 3, "chunks_size": 1000, "fps": 30, "splits": {…

  • 880 downloads
  • 1 likes
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

IndustryShapes

Published Mar 2026 · mit · POSE-Lab

IndustryShapes Project Page | Paper IndustryShapes is a new benchmark dataset tailored for 6D object pose estimation in industrial settings.

  • 675 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Text
  • Parquet

berkeley_gnm_sac_son

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 2955, "total_frames": 241059, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…

  • 957 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

berkeley_gnm_cory_hall

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 7331, "total_frames": 156012, "total_tasks": 1, "chunks_size": 1000, "fps": 5, "splits": {…

  • 564 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

aloha_static_tape

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 505 downloads
  • 3 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

RoboTwin_clean

Published Mar 2026 · apache-2.0 · SUZ-tsinghua

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 2500, "total_frames": 549787, "total_tasks": 2409, "chunks_size": 1000,…

  • 1,012 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

kumagong

Published Mar 2026 · cc-by-nc-sa-4.0 · balatubs123

KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task_A Single task Initial state: T-shirts…

  • 78,972 downloads
  • not specified

level2_final_quality3_trim_0_hil_data

Published Mar 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks": 1, "chunks_size": 1000,…

  • 1,631 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

level2_final_quality3_trim

Published Mar 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1197, "total_frames": 3146294, "total_tasks": 1, "chunks_size": 1000,…

  • 1,436 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Published Mar 2026 · cc-by-4.0 · nvidia

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robot embodiments and tasks.

  • 350,718 downloads
  • 223 likes
  • not specified

justaddforce-data

Published Mar 2026 · apache-2.0 · correlllab

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 130, "total_frames": 4435, "total_tasks": 36, "chunks_size": 1000, "data_files_size_in_mb": 100,…

  • 325 downloads
  • 1 likes
  • not specified

xarm_push_medium_replay_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 2,626 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

xarm_push_medium_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 2,661 downloads
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

xarm_lift_medium_replay_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 2,650 downloads
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

xarm_lift_medium_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 2,643 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

pusht_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 13,329 downloads
  • 6 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

metaworld_mt50

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…

  • 8,738 downloads
  • 17 likes
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

aloha_sim_transfer_cube_scripted_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 9,650 downloads
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

aloha_sim_transfer_cube_scripted

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 12,120 downloads
  • 6 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

aloha_sim_transfer_cube_human_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 9,767 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

aloha_sim_transfer_cube_human

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 50, "total_chunks": 1,…

  • 26,167 downloads
  • 11 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_sim_insertion_scripted_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 9,610 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

aloha_sim_insertion_scripted

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 50, "total_chunks": 1,…

  • 14,048 downloads
  • 5 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_sim_insertion_human_image

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…

  • 9,832 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

aloha_sim_insertion_human

Published Mar 2026 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_videos": 50, "total_chunks": 1,…

  • 16,413 downloads
  • 13 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

libero

Published Mar 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks":40, "chunks_size": 1000, "data_files_size_in_mb":…

  • 4,453 downloads
  • 9 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

Kai0

Published Mar 2026 · cc-by-nc-sa-4.0 · OpenDriveLab-org

KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task_A Single task Initial state: T-shirts…

  • 12,110 downloads
  • 34 likes
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

SandGO

Published Mar 2026 · apache-2.0 · XDUImageLab

SandGO Dataset (v1.0) SandGO 是一个专为具身智能 (Embodied AI) 和机器人学习设计的 long-horizon 多模态数据集。该数据集由原始 MarsMind_data 经过精炼、去冗余和结构化处理后得到,旨在支持 长序列决策、指令跟随 和 多模态模仿学习 等任务。数据集包含完整的状态‑动作序列、多视角 RGB 图像、LiDAR 点云、深度点云 以及对应的 自然语言指令,为训练 Vision-Language-Action (VLA)…

  • 1,825 downloads
  • 1 likes
  • not specified
  • Image

SandThink

Published Mar 2026 · mit · XDUImageLab

SandThink Dataset (v1.0) SandThink 是一个专为具身智能 (Embodied AI) 任务设计的大规模指令微调与偏好对齐数据集。该数据集通过结构化的 Chain-of-Thought (CoT) 推理过程,显著提升了 Vision-Language-Action (VLA) 模型在复杂环境下的任务拆解、路径规划和动作执行能力。 📊 数据集概览 (Dataset Summary) 本数据集包含三个核心组件,总计约 37,000 条高质量数据:…

  • 6,566 downloads
  • n<1K
  • Image
  • Imagefolder

humanoid-everyday

Published Mar 2026 · apache-2.0 · FedorX8

Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world robotic learning and embodied intelligence.

  • 10,991 downloads
  • 1K<n<10K
  • Paper available
  • Video

connect4_large-1-recut

Published Mar 2026 · apache-2.0 · tomo202

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 108, "total_frames": 16645, "total_tasks": 1, "chunks_size": 1000,…

  • 686 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

MOSAIC_Dataset

Published Mar 2026 · cdla-permissive-2.0 · BAAI-Humanoid

MOSAIC Dataset Project Page | Paper | Code | Dataset | Model This repository releases the built-in MOSAIC multi-source motion dataset in the following paper: MOSAIC: Bridging the Sim-to-Real Gap in Generalist Humanoid Motion Tracking and…

  • 826 downloads
  • 7 likes
  • n<1K
  • Paper available
  • Document
  • Text
  • Text

dk1_2026-03-03

Published Mar 2026 · apache-2.0 · Gongsta

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 132, "total_frames": 273043, "total_tasks": 1, "chunks_size": 1000,…

  • 1,346 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

dk1_2026-03-02

Published Mar 2026 · apache-2.0 · Gongsta

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 61, "total_frames": 120265, "total_tasks": 1, "chunks_size": 1000,…

  • 622 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

dk1_2026-03-01

Published Mar 2026 · apache-2.0 · Gongsta

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 141, "total_frames": 306425, "total_tasks": 1, "chunks_size": 1000,…

  • 1,653 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

high_quality_folding

Published Feb 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks": 1, "chunks_size": 1000,…

  • 20,763 downloads
  • 2 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

language_table_lerobot

Published Feb 2026 · apache-2.0 · FedorX8

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605, "chunks_size": 1000, "fps": 10,…

  • 318 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

bridge_v2_lerobot

Published Feb 2026 · apache-2.0 · FedorX8

This dataset was created using LeRobot. Dataset Description Converted BridgeV2 dataset to lerobot format. Downloaded from [https://rail.eecs.berkeley.edu/datasets/bridge_release/data/] (only demos_8_17.zip, not scripted_6_18.zip) Dataset…

  • 1,247 downloads
  • 1M<n<10M
  • Tabular
  • Text
  • Parquet

VLA_Arena_L0_S_lerobot_openpi

Published Feb 2026 · apache-2.0 · VLA-Arena

VLA-Arena Dataset (L0 - Small Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models.

  • 339 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Optimized Parquet

VLA_Arena_L0_L_lerobot_openpi

Published Feb 2026 · apache-2.0 · VLA-Arena

VLA-Arena Dataset (L0 - Large Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models.

  • 1,589 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Optimized Parquet

wildos

Published Feb 2026 · apache-2.0 · leggedrobotics

WildOS Frontiers Dataset Dataset Description This dataset provides visual frontier annotations for outdoor long-range navigation, created for WildOS: Open-Vocabulary Object Search in the Wild.

  • 945 downloads
  • 2 likes
  • 1K<n<10K
  • Paper available
  • Image
  • Imagefolder

soarm101_pickplace_10d_240e_closed

Published Feb 2026 · apache-2.0 · edgarcancinoe

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 240, "total_frames": 180027, "total_tasks": 1, "chunks_size": 1000,…

  • 326 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

bridge_data_v2_teleop

Published Feb 2026 · cc-by-4.0 · jnogga

BridgeData V2 Teleoperated Demos Teleoperated demonstrations in BridgeData V2. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~86% of all teleoperated episodes.

  • 1,092 downloads
  • 1M<n<10M

L2D-v3

Published Feb 2026 · apache-2.0 · peak888

This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.

  • 2,811 downloads
  • not specified
  • Video

RealSource-World

Published Feb 2026 · cc-by-nc-sa-4.0 · RealSourceData

RealSource World RealSource World is a large-scale real-world robotics manipulation dataset collected using RS-02 dual-arm humanoid robot.

  • 10,280 downloads
  • 13 likes
  • 10M<n<100M
  • Gated access

droid_failure

Published Feb 2026 · cc-by-4.0 · jnogga

DROID Failure Episodes Failure episodes in DROID-COMMUNITY. Episodes with inconsistent data were discarded, leaving ~91% of all failure episodes.

  • 3,832 downloads
  • 1M<n<10M

droid_success

Published Feb 2026 · cc-by-4.0 · jnogga

DROID Success Episodes Successful episodes in DROID-COMMUNITY. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~89% of all successful episodes.

  • 16,949 downloads
  • 10M<n<100M

RoboChallenge-lerobot-merged

Published Feb 2026 · apache-2.0 · Traly

RoboChallenge-lerobot-merged Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset (merged version) in LeRobot v3.0 format.

  • 12,422 downloads
  • 10M<n<100M
  • Tabular
  • Text
  • Parquet

RoboInter-Data

Published Feb 2026 · not specified · InternRobotics

RoboInter-Data: Intermediate Representation Annotations for Robot Manipulation Rich, dense, per-frame intermediate representation annotations for robot manipulation, built on top of DROID and RH20T.

  • 5,682 downloads
  • 9 likes
  • 1K<n<10K
  • Paper available
  • Text
  • Video
  • JSON

droid_v3

Published Feb 2026 · apache-2.0 · DAVIAN-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630, "chunks_size": 1000,…

  • 6,517 downloads
  • 10M<n<100M
  • Tabular
  • Text
  • Parquet

SafeFlowMPC

Published Feb 2026 · mit · ThiesOelerich

SafeFlowMPC Finetuning Dataset This repository contains the finetuning dataset used in the paper SafeFlowMPC: Predictive and Safe Trajectory Planning for Robot Manipulators with Learning-based Policies.

  • 1,156 downloads
  • 100K<n<1M
  • Paper available
  • Optimized Parquet

DECO-50

Published Feb 2026 · apache-2.0 · BAAI-Humanoid

DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter DECO-50 is a bimanual dexterous manipulation dataset with tactile sensing, comprising 50 hours of teleoperated data across 4…

  • 27,922 downloads
  • 4 likes
  • 10M<n<100M
  • Paper available
  • Image
  • Text
  • Webdataset

full_folding

Published Feb 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5688, "total_frames": 14129038, "total_tasks": 1, "chunks_size": 1000,…

  • 15,605 downloads
  • 3 likes
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

carla-dataset-ped

Published Feb 2026 · cc-by-4.0 · mkxdxd

CARLA Stage 2 Pedestrian Dataset A large-scale driving dataset (Stage 2) focused on pedestrians, captured from CARLA simulator.

  • 4,300 downloads
  • 100K<n<1M

droid_success_high_quality

Published Feb 2026 · cc-by-4.0 · jnogga

DROID Success (High-Quality Extrinsics) Subset of ~17k successful episodes in DROID-COMMUNITY filtered for high-quality camera extrinsics.

  • 6,916 downloads
  • 1 likes
  • 1M<n<10M

RoboInter-VQA

Published Feb 2026 · not specified · InternRobotics

RobotInter-VQA: Intermediate Representation Understanding & Generation VQA Dataset for Manipulation English | 简体中文 A Visual Question Answering dataset for robotic manipulation, developed as part of the RoboInter, covering generation,…

  • 1,429 downloads
  • 6 likes
  • not specified
  • Paper available

nemotron-vla-metaworld

Published Feb 2026 · mit · keivalya

Nemotron-VLA MetaWorld Expert Demonstrations Expert demonstration dataset for training Vision-Language-Action (VLA) models on MetaWorld robot manipulation tasks. Built for the Nemotron-VLA project.

  • 549 downloads
  • 1 likes
  • 100K<n<1M
  • Text
  • Timeseries
  • Parquet

RoboChallenge-lerobot

Published Feb 2026 · apache-2.0 · Traly

RoboChallenge-lerobot Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset in LeRobot v3.0 format.

  • 6,361 downloads
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

mimic_tictactoe_redx_full30hz

Published Feb 2026 · apache-2.0 · Mimic-Robotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 554, "total_frames": 428498, "total_tasks": 11, "chunks_size": 1000,…

  • 608 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

dagger_final_1_21

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1, "total_frames": 841, "total_tasks": 1, "chunks_size": 1000,…

  • 1,993 downloads
  • n<1K
  • Tabular
  • Timeseries
  • Parquet

retail_restock_11_feb_no_verb

Published Feb 2026 · apache-2.0 · tma-hiverobots

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Unitree_G1_Dex3", "total_episodes": 124, "total_frames": 13344, "total_tasks": 3, "chunks_size": 1000,…

  • 372 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

level2_final_quality2_augmented

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 2366, "total_frames": 6205242, "total_tasks": 1, "chunks_size": 1000,…

  • 3,360 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

level2_final_quality2

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1183, "total_frames": 3102621, "total_tasks": 1, "chunks_size": 1000,…

  • 1,753 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

level1_final_quality0

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 631, "total_frames": 1055899, "total_tasks": 2, "chunks_size": 1000,…

  • 566 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

bridge_orig_lerobot

Published Feb 2026 · apache-2.0 · arekborucki

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…

  • 54,264 downloads
  • not specified

level12_rac_2_2026-02-09

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 537, "total_frames": 1835987, "total_tasks": 2, "chunks_size": 1000,…

  • 1,066 downloads
  • 1 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

VITRA-TeleData

Published Feb 2026 · mit · microsoft

VITRA Teleoperation Dataset Dataset Summary This dataset contains real-world robot teleoperation demonstrations collected using a 7-DoF robotic arm equipped with a dexterous hand and a head-mounted RGB camera.

  • 1,120 downloads
  • 1 likes
  • 1K<n<10K
  • Paper available

level12_quality0_2026-02-08

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 258, "total_frames": 595888, "total_tasks": 1, "chunks_size": 1000,…

  • 499 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

level12_rac_2_2026-02-07_and_mir

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 10770, "total_frames": 26748966, "total_tasks": 1, "chunks_size": 1000,…

  • 18,546 downloads
  • 1 likes
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

threeBall_origin

Published Feb 2026 · apache-2.0 · twoeight

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 101, "total_frames": 247024, "total_tasks": 2, "chunks_size": 1000,…

  • 342 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

level12_rac_2_2026-02-07

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5551, "total_frames": 13805156, "total_tasks": 1, "chunks_size": 1000,…

  • 13,951 downloads
  • 2 likes
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

level12_rac_2_2026-02-05_ag

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5413, "total_frames": 13497374, "total_tasks": 1, "chunks_size": 1000,…

  • 9,997 downloads
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

BimanualUR5eExample

Published Feb 2026 · apache-2.0 · robotics-diffusion-transformer

Dataset Summary This dataset provides shards in the WebDataset format for fine-tuning RDT-2 or other policy models on bimanual manipulation.

  • 872 downloads
  • 3 likes
  • 100K<n<1M
  • Paper available
  • Image
  • Text
  • Webdataset

3d_optical_flow_droid

Published Feb 2026 · mit · Salesforce

3D Optical Flow DROID Dataset Processed DROID robotics dataset with optical flow and scene flow annotations. Dataset Structure Organized by lab, each trajectory in separate tar.gz archive:…

  • 14,567 downloads
  • 10M<n<100M
  • Image
  • Text
  • Webdataset

InternData-N1

Published Feb 2026 · cc-by-sa-4.0 · InternRobotics

InternData-N1 InternData-N1 is a large-scale, unified vision-language navigation dataset consolidating multiple benchmarks into a standardized format.

  • 47,044 downloads
  • 68 likes
  • n>1T
  • Gated access

refiner-utn-pick-cuboid-sim-v1

Published Feb 2026 · apache-2.0 · Juelg

Simulated Franka Pick-Cube Tactile Dataset The dataset was generated using the Robot Control Stack (RCS). RCS is a flexible Gymnasium wrapper-based robot control interface made for robot learning and specifically Vision-Language-Action…

  • 783 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Image
  • Timeseries
  • Parquet

level12_rac_2_2026-02-04

Published Feb 2026 · apache-2.0 · lerobot-data-collection

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5357, "total_frames": 13251592, "total_tasks": 1, "chunks_size": 1000,…

  • 9,734 downloads
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

furniture_bench_dataset_lerobot

Published Feb 2026 · apache-2.0 · FedorX8

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "chunks_size": 1000, "fps": 10, "splits": {…

  • 411 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

InternScenes

Published Feb 2026 · not specified · InternRobotics

InternScenes InternScenes is a large-scale interactive indoor scene dataset with realistic layouts. This dataset comprises approximately 40,000 diverse scenes and 1.96M 3D objects that cover 15 common scene types and 288 object classes,…

  • 16,673 downloads
  • 36 likes
  • 1M<n<10M
  • Gated access
  • 3d
  • Image
  • Webdataset

Transparent_BOP

Published Feb 2026 · cc-by-4.0 · varunburde

Object Pose Estimation Using Implicit Representation for Transparent Objects This dataset aggregates high quality 3D mesh assets and rendered data for training and fine-tuning pose estimation models.

  • 3,902 downloads
  • 100K<n<1M

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams

Published Feb 2026 · cc-by-4.0 · kze193825

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a dataset from huggingface.

  • 22,090 downloads
  • n>1T
  • Paper available

FreeTacMan

Published Feb 2026 · mit · OpenDriveLab

📦 FreeTacman Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation [ICRA 2026] 🎯 Overview This dataset supports the paper FreeTacman: Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation.

  • 13,952 downloads
  • 4 likes
  • not specified
  • Paper available
  • Video

coop_alldata

Published Jan 2026 · apache-2.0 · lucanunz

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 980, "total_frames": 106944, "total_tasks": 10, "total_videos": 3920, "total_chunks": 1,…

  • 478 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

UrbanNav

Published Jan 2026 · mit · Vigar001

UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories Yanghong Mei*1,5, Yirong Yang*2, Longteng Guo†1, Qunbo Wang3, Ming-Ming Yu2, Xingjian He1, Wenjun Wu2,4, Jing Liu1,5, 1Institute of Automation, Chinese…

  • 533 downloads
  • not specified
  • Paper available

libero_10_subtasks

Published Jan 2026 · apache-2.0 · KeWangRobotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "libero_panda", "total_episodes": 500, "total_frames": 138090, "total_tasks": 10, "chunks_size": 1000,…

  • 555 downloads
  • 100K<n<1M
  • Image
  • Text
  • Parquet

G1_Dex1_Clean_Table

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,220 downloads
  • 14 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_DualRobot_Clean_Table

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,098 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Erase_Board

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,277 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Fold_Towel

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,419 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Pack_PencilBox

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,316 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Organize_Tools

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 802 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Pack_PingPong

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,476 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Wipe_Table

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 989 downloads
  • 3 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Stack_Block

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,343 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Pour_Medicine

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 867 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Prepare_Fruit

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 634 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Bag_Insert

Published Jan 2026 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 1,000 downloads
  • 2 likes
  • not specified
  • Video

DR_dataset

Published Jan 2026 · mit · ganatrask

DR_dataset Simulated robot manipulation dataset for imitation learning. Dataset Description This dataset contains 200 episodes of a Trossen WXAI robotic arm performing food transfer tasks in MuJoCo simulation.

  • 3,206 downloads
  • 1K<n<10K

SimScale

Published Jan 2026 · apache-2.0 · OpenDriveLab

Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Tian ([email protected]) 📜…

  • 3,007 downloads
  • 1 likes
  • n>1T
  • Paper available

SimScale

Published Jan 2026 · apache-2.0 · OpenDriveLab-org

Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Tian ([email protected]) 📜…

  • 1,537 downloads
  • 1 likes
  • 10M<n<100M
  • Paper available
  • Image
  • Text
  • Webdataset

RBench

Published Jan 2026 · cc-by-4.0 · DAGroup-PKU

Rethinking Video Generation Model for the Embodied World 🔍 Benchmark Overview The benchmark is constructed from two complementary perspectives: task categories and robot embodiment types, covering a total of 650 image-text evaluation…

  • 554 downloads
  • 3 likes
  • not specified
  • Paper available

ArtVIP

Published Jan 2026 · apache-2.0 · dtttttiiiii

ArtVIP dataset card Key Features ✅ 206 high-quality digital-twin articulated objects. ✅ 6 pre-configured digital-twin scenes, 6 user-defined scenes ✅ Reusable Modular interaction ✅ Physics fidelity ✅ Pixel-level affordance annotations…

  • 575 downloads
  • not specified
  • Paper available

RPD-maniskill

Published Jan 2026 · apache-2.0 · Juelg

Maniskill Sub-Dataset in RLDS Format used in RPD This repository contains the maniskill subset in RLDS format used to train Octo and OpenVLA in the paper Refined Policy Distillation: From VLA Generalists to RL Experts, which distilled…

  • 374 downloads
  • 1 likes
  • not specified
  • Paper available

libero_10

Published Jan 2026 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks_size": 1000, "data_files_size_in_mb":…

  • 1,076 downloads
  • 4 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

openarm_pick_v6

Published Jan 2026 · apache-2.0 · AiSaurabhPatil

OpenArm Pick v6 - LeRobot Dataset A LeRobot v2.1 dataset for bimanual robot manipulation, recorded in Isaac Sim with teleoperation.

  • 462 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

PND_Adam-U_pour-cup

Published Jan 2026 · cc-by-4.0 · BeingBeyond

This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).

  • 317 downloads
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

PND_Adam-U_pick-simple_speed2x

Published Jan 2026 · cc-by-4.0 · BeingBeyond

This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).

  • 650 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

PND_Adam-U_pick-simple

Published Jan 2026 · cc-by-4.0 · BeingBeyond

This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).

  • 1,030 downloads
  • 4 likes
  • 1M<n<10M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

wanderland

Published Jan 2026 · apache-2.0 · ai4ce

Wanderland Dataset Dataset Description Wanderland is a large-scale urban dataset designed for geometrically grounded simulation and open-world embodied AI research.

  • 5,814 downloads
  • 4 likes
  • 100K<n<1M
  • Paper available
  • Gated access
  • 3d
  • Text
  • Text

AgiBotWorld-Alpha-lerobot

Published Jan 2026 · apache-2.0 · Traly

Note This is an unofficial implementation. This dataset was converted from AgiBotWorld-Alpha into the LeRobot format using https://github.com/Tavish9/any4lerobot/tree/main/agibot2lerobot Configurations Each sub-task is available as a…

  • 3,607 downloads
  • 3 likes
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

taco_play_lerobot

Published Jan 2026 · apache-2.0 · ygtxr1997

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "total_videos": 6484, "total_chunks": 4,…

  • 625 downloads
  • not specified
  • Video

so101_pick_cube_chunked

Published Jan 2026 · apache-2.0 · gpudad

SO101 Pick Cube Dataset (Chunked) This is a restructured version of the gpudad/so101_pick_cube dataset with episode-level video files for faster data loading during training. Why Chunked?

  • 672 downloads
  • 1M<n<10M
  • Tabular
  • Video
  • Parquet

bc_z_lerobot

Published Jan 2026 · apache-2.0 · ygtxr1997

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 104, "total_videos": 39350,…

  • 360 downloads
  • not specified
  • Video

fractal20220817_data_lerobot

Published Jan 2026 · apache-2.0 · ygtxr1997

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 599, "total_videos": 87212,…

  • 3,307 downloads
  • not specified

bridge_orig_lerobot

Published Jan 2026 · apache-2.0 · ygtxr1997

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…

  • 1,477 downloads
  • not specified
  • Video

grand_tour_dataset

Published Jan 2026 · mit · leggedrobotics

The GrandTour Dataset A project brought to you by RSL - ETH Zurich. References • Contributing • Citation References Official dataset webpage: grand-tour.leggedrobotics.com Getting started & examples:…

  • 9,615 downloads
  • 15 likes
  • 1M<n<10M

NitroGen

Published Jan 2026 · cc-by-nc-4.0 · nvidia

Website | Model | Dataset | Code | Paper NitroGen Dataset Dataset Description: The NitroGen dataset contains action annotations for publicly available gameplay videos.

  • 1,096 downloads
  • 201 likes
  • not specified
  • Paper available

TraceSpatial-Bench

Published Jan 2026 · apache-2.0 · JingkunAn

TraceSpatial-Bench: An Object-Centric 3D Trajectory Planning Benchmark Welcome to TraceSpatial-Bench, an object-centric 3D spatial trace planning benchmark provided by RoboTracer.TraceSpatial-Bench is the first benchmark that evaluates…

  • 1,159 downloads
  • 4 likes
  • n<1K
  • Paper available
  • Image
  • Text
  • Optimized Parquet

bridgev2

Published Jan 2026 · apache-2.0 · Saberlve

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…

  • 60,561 downloads
  • not specified

AAWR-DROID

Published Jan 2026 · mit · Everloom

Active Perception AAWR Dataset in GRASP Lab Mock Kitchen Paper | Project page | Code Here is our full dataset (13.5GB) in DROID format: https://huggingface.co/datasets/Everloom/AAWR-DROID/ It includes 4 scene as below, each are cleaned…

  • 1,290 downloads
  • 10B<n<100B
  • Paper available
  • Image

humanoid-everyday

Published Jan 2026 · apache-2.0 · Fizzi789

Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world robotic learning and embodied intelligence.

  • 691 downloads
  • 1K<n<10K
  • Paper available
  • Video

comet-1.5k

Published Jan 2026 · mit · delinqu

Comet-1.5k — RFT Dataset (Openpi-Comet) This repository releases Comet-1.5k, an RFT (Rejection Sampling Fine-Tuning) dataset generated by Team Comet’s post-training pipeline on the BEHAVIOR-1K simulator (OmniGibson).It is intended to…

  • 6,384 downloads
  • 4 likes
  • 1K<n<10K
  • Paper available

Lightwheel-Tasks-X7S

Published Dec 2025 · apache-2.0 · LightwheelAI

This dataset was created using LeRobot. 1 Dataset Description This dataset includes 117 Lightwheel-Libero-Tasks and 89 Lightwheel-Robocasa-Tasks, collected using the x7s robot in environments provided by LW-BenchHub.The robot configuration…

  • 372 downloads
  • 10M<n<100M
  • Tabular
  • Timeseries
  • Parquet

bridge_v2_lerobot

Published Dec 2025 · apache-2.0 · ryanqian1994

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…

  • 4,422 downloads
  • 1 likes
  • not specified

VLA_Arena_L1_M_lerobot_smolvla

Published Dec 2025 · apache-2.0 · VLA-Arena

VLA-Arena Dataset (L1 - Medium Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models.

  • 339 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Optimized Parquet

AgiBotWorld-Beta_G1_task_463_Clean_the_microwave_and_range_hood

Published Dec 2025 · not specified · BAAI-DataCube

agibot_task_463 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 清洁微波炉和抽油烟机。 total_episodes: 1780 total_tasks: 1 size: 57G Dataset Structure ├── data │ └──…

  • 419 downloads
  • n<1K
  • Video

AgiBotWorld-Beta_G1_task_537_make_the_bed

Published Dec 2025 · not specified · BAAI-DataCube

agibot_task_537 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 整理床铺 total_episodes: 751 total_tasks: 1 size: 64G Dataset Structure ├── data │ └── chunk-xxx │…

  • 411 downloads
  • n<1K
  • Video

AgiBot-g1_picks_up_parts_b

Published Dec 2025 · apache-2.0 · RoboCOIN

AgiBot-g1_picks_up_parts_b 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.

  • 10,455 downloads
  • 100K<n<1M
  • Paper available
  • Gated access
  • Tabular
  • Timeseries
  • Parquet

Cobot_Magic_make_hamburger

Published Dec 2025 · apache-2.0 · RoboCOIN

Cobot_Magic_make_hamburger 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.

  • 6,044 downloads
  • 1M<n<10M
  • Paper available
  • Gated access
  • Tabular
  • Timeseries
  • Parquet

pick

Published Dec 2025 · apache-2.0 · zilch512

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 9, "total_frames": 289, "total_tasks": 2, "total_videos": 0, "total_chunks": 1, "chunks_size":…

  • 344 downloads
  • 1K<n<10K
  • Image
  • Timeseries
  • Parquet

EO-Data1.5M

Published Dec 2025 · apache-2.0 · IPEC-COMMUNITY

🤖 EO-Data-1.5M A Large-Scale Interleaved Vision-Text-Action Dataset for Embodied AI The first large-scale interleaved embodied dataset emphasizing temporal dynamics and causal dependencies among vision, language, and action modalities.

  • 4,302 downloads
  • 21 likes
  • 1M<n<10M
  • Paper available
  • Image
  • Text
  • Parquet

SAGE-3D_Collision_Mesh

Published Dec 2025 · cc-by-nc-4.0 · spatialverse

SAGE-3D Collision Mesh: Physics-Enabled Collision Bodies for 3D Gaussian Scenes Paper | Project Page | Code High-precision collision geometry dataset extracted from 1,000 indoor Mesh scenes, enabling physically accurate navigation and…

  • 2,494 downloads
  • 101 likes
  • n<1K
  • Paper available
  • Image
  • Imagefolder

SAGE-3D_InteriorGS_usdz

Published Dec 2025 · cc-by-nc-4.0 · spatialverse

SAGE-3D InteriorGS USDZ: USDZ-Format 3D Gaussian Scenes for Isaac Sim Paper | Project Page | Code InteriorGS dataset converted to USDZ format for seamless integration with NVIDIA Omniverse and Isaac Sim platforms.

  • 5,400 downloads
  • 86 likes
  • n<1K
  • Paper available
  • 3d
  • Image
  • Imagefolder

SAGE-3D_VLN_Data

Published Dec 2025 · cc-by-4.0 · spatialverse

SAGE-3D VLN Data: Vision-Language Navigation Dataset with Hierarchical Instructions Paper | Project Page | Code A comprehensive VLN dataset featuring 2 million trajectory-instruction pairs across 1,000 indoor scenes, with hierarchical…

  • 1,281 downloads
  • 69 likes
  • not specified
  • Paper available

minecraft-vla-stage1

Published Dec 2025 · mit · TESS-Computer

Minecraft VLA Stage 1: Action Pretraining Data Vision-Language-Action training data for Minecraft, processed from OpenAI's VPT contractor dataset.

  • 583 downloads
  • 3 likes
  • 10M<n<100M
  • Paper available
  • Text
  • Parquet

community_dataset_v3

Published Dec 2025 · apache-2.0 · Johnbosco20

Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets across 46 robot types, enabling…

  • 464 downloads
  • 10M<n<100M
  • Paper available

aloha_static_towel

Published Dec 2025 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 564 downloads
  • 3 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

finish_sandwich

Published Dec 2025 · apache-2.0 · izuluaga

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 80, "total_frames": 70277, "total_tasks": 2, "chunks_size": 1000,…

  • 1,299 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

2025-challenge-demos

Published Dec 2025 · mit · elonelonelon

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…

  • 331 downloads
  • 1 likes
  • 100M<n<1B
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

droid_1.0.1_test

Published Dec 2025 · apache-2.0 · aractingi

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630, "chunks_size": 1000,…

  • 662 downloads
  • 10M<n<100M
  • Tabular
  • Text
  • Parquet

community_dataset_v3

Published Dec 2025 · apache-2.0 · HuggingFaceVLA

Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets across 46 robot types, enabling…

  • 4,529 downloads
  • 22 likes
  • 10M<n<100M
  • Paper available

calvin_task_D_D_scale_100_lerobo_format_addmask_v3

Published Dec 2025 · apache-2.0 · ducido

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 5124, "total_frames": 308918, "total_tasks": 389, "total_videos": 0, "total_chunks": 6,…

  • 1,329 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

GATE-VLAP-datasets

Published Dec 2025 · not specified · gate-institute

GATE-VLAP Datasets Grounded Action Trajectory Embeddings with Vision-Language Action Planning This repository contains preprocessed datasets from the LIBERO benchmark suite in WebDataset TAR format, specifically designed for training…

  • 474 downloads
  • 3 likes
  • 100K<n<1M
  • Image
  • Text
  • Webdataset

xvla-soft-fold

Published Dec 2025 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Description Repository: X-VLA License: Apache 2.0 Paper: Zheng et al., 2025, “X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model” (arXiv:2510.10274)…

  • 788 downloads
  • 7 likes
  • 1M<n<10M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-SimReady-Warehouse-01

Published Dec 2025 · cc-by-4.0 · nvidia

NVIDIA Physical AI SimReady Warehouse OpenUSD Dataset Dataset Version: 1.1.0 Date: May 18, 2025 Author: NVIDIA, Corporation License: CC-BY-4.0 (Creative Commons Attribution 4.0 International) Contents This dataset includes the following:…

  • 4,673 downloads
  • 42 likes
  • n<1K
  • Image
  • Text
  • Csv

COMMAND

Published Dec 2025 · cc-by-nc-4.0 · maum-ai

COMMAND Dataset 😃 🤗 Dataset Availability The evaluation data from this dataset is now available on Hugging Face: Repository: maum-ai/COMMAND Content: 60 evaluation scenarios (240 files, ~38GB) File Types: ROS bagfiles (.bag), metadata…

  • 1,903 downloads
  • 7 likes
  • 1K<n<10K

2025-challenge-demos

Published Dec 2025 · mit · behavior-1k

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…

  • 75,097 downloads
  • 33 likes
  • not specified
  • Paper available
  • Video

Arena-G1-Loco-Manipulation-Task

Published Dec 2025 · cc-by-4.0 · nvidia

Dataset Description: The Arena-G1-Loco-Manipulation-Task dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (G1) loco-manipulation task in IsaacLab-Arena environment.

  • 622 downloads
  • 3 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

CRCD

Published Nov 2025 · cc-by-4.0 · SITL-Eng

Comprehensive Robotic Cholecystectomy Dataset (CRCD) The Comprehensive Robotic Cholecystectomy Dataset (CRCD) is a large-scale, multimodal dataset for robot-assisted surgery (RAS) research.It provides synchronized endoscopic videos, da…

  • 423 downloads
  • 3 likes
  • 100K<n<1M
  • Paper available
  • Image
  • Text
  • Parquet

navitrace

Published Nov 2025 · cc-by-4.0 · leggedrobotics

NaviTrace 🏠 Project 📄 Paper 💻 Code 🏆 Leaderboard NaviTrace is a novel VQA benchmark for VLMs that evaluates models on their embodiment-specific understanding of navigation across challenging real-world scenarios.

  • 645 downloads
  • 5 likes
  • 1K<n<10K
  • Paper available
  • Image
  • Text
  • Parquet

roboverse_data

Published Nov 2025 · apache-2.0 · RoboVerseOrg

This dataset is part of the RoboVerse project, as described in the paper RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning.

  • 50,345 downloads
  • 20 likes
  • not specified
  • Paper available

fractal-rlds

Published Nov 2025 · mit · shihao1895

Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Fractal dataset.

  • 2,232 downloads
  • not specified
  • Paper available

behavior1k

Published Nov 2025 · apache-2.0 · fracapuano

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "R1Pro", "total_episodes": 2000, "total_frames": 21227314, "total_tasks": 10, "chunks_size": 1000,…

  • 662 downloads
  • 1 likes
  • 100M<n<1B
  • Tabular
  • Timeseries
  • Parquet

berkeley-frodobots-lerobot-7k

Published Nov 2025 · apache-2.0 · frodobots

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "frodobot", "total_episodes": 185761, "total_frames": 163360777, "total_tasks": 1, "chunks_size": 1000,…

  • 2,937 downloads
  • 4 likes
  • 100M<n<1B
  • Tabular
  • Timeseries
  • Parquet

so-combined-ru

Published Nov 2025 · apache-2.0 · dunnolab

Датасет создан при помощи библиотеки LeRobot. Описание датасета Русскоязычная версия данного датасета объединяет 598 открытых датасетов сообщества в единый унифицированный корпус, включающий 22 709 эпизодов и примерно 9,4 миллиона кадров…

  • 1,227 downloads
  • 1 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

so-combined-eng

Published Nov 2025 · apache-2.0 · dunnolab

This dataset was created using LeRobot. Dataset Description The English version of this dataset integrates 598 open-source community datasets into a single unified corpus, comprising 22,709 episodes and approximately 9.4 million frames…

  • 1,947 downloads
  • 6 likes
  • 1M<n<10M
  • Tabular
  • Timeseries

bridge_train_5000_10000_augmented

Published Nov 2025 · cc-by-4.0 · oxe-auge

bridge_train_5000_10000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 168,737 Splits: train: 0:5000 Data Layout data_path :…

  • 350 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Text
  • Parquet

bridge_train_20000_25460_augmented

Published Nov 2025 · cc-by-4.0 · oxe-auge

bridge_train_20000_25460_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,460 Frames: 185,428 Splits: train: 0:5460 Data Layout data_path :…

  • 404 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Text
  • Parquet

bridge_train_15000_20000_augmented

Published Nov 2025 · cc-by-4.0 · oxe-auge

bridge_train_15000_20000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 169,127 Splits: train: 0:5000 Data Layout data_path :…

  • 351 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Text
  • Parquet

bridge_train_10000_15000_augmented

Published Nov 2025 · cc-by-4.0 · oxe-auge

bridge_train_10000_15000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,583 Splits: train: 0:5000 Data Layout data_path :…

  • 335 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Text
  • Parquet

bridge_train_0_5000_augmented

Published Nov 2025 · cc-by-4.0 · oxe-auge

bridge_train_0_5000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,417 Splits: train: 0:5000 Data Layout data_path :…

  • 555 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Text
  • Parquet

bridge_test_0_3475_augmented

Published Nov 2025 · cc-by-4.0 · oxe-auge

bridge_test_0_3475_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 3,475 Frames: 118,603 Splits: train: 0:3475 Data Layout data_path :…

  • 354 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Text
  • Parquet

community_dataset_v2

Published Nov 2025 · apache-2.0 · HuggingFaceVLA

Community Dataset v2 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 340 datasets from 117 contributors worldwide.

  • 3,216 downloads
  • 3 likes
  • n<1K
  • Paper available
  • Video

community_dataset_v1

Published Nov 2025 · apache-2.0 · HuggingFaceVLA

Community Dataset v1 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 128 datasets from 55 contributors worldwide. We used this dataset to pretrain SmolVLA.

  • 4,357 downloads
  • 5 likes
  • 10K<n<100K
  • Paper available
  • Video

pickplace_r2

Published Nov 2025 · apache-2.0 · DRMNmadhan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 75051, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 321 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

iamlab_cmu_pickup_insert_train_500_631_augmented

Published Nov 2025 · cc-by-4.0 · oxe-auge

iamlab_cmu_pickup_insert_train_500_631_augmented Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, sawyer, ur5e, widowX, xarm7 FPS: 20.0 Episodes: 131 Frames: 30,143 Videos: 1,179 Chunks: 1 Splits:…

  • 357 downloads
  • 10K<n<100K
  • Tabular
  • Text
  • Parquet

lerobot_so101_base_sim_pickplace

Published Nov 2025 · apache-2.0 · davidlinjiahao

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 5, "total_frames": 1153, "total_tasks": 1, "total_videos": 5, "total_chunks": 1,…

  • 1,385 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

libero-rlds

Published Nov 2025 · mit · shihao1895

Dataset Structure These datasets are used for MemoryVLA training. This is the standard LIBERO setting and can be directly used for other models as well.All data follow the RLDS format from the LIBERO benchmark, where each task initially…

  • 2,434 downloads
  • not specified
  • Paper available

XVLA-Soft-Fold

Published Nov 2025 · mit · Facebear

Cloth-Folding Dataset for X-VLA Paper This dataset contains 1,500 episodes of cloth folding, collected using Agilex's robotic arm.

  • 47,595 downloads
  • 10 likes
  • 1K<n<10K
  • Paper available

bridge-rlds

Published Nov 2025 · mit · shihao1895

Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Bridge dataset.

  • 4,466 downloads
  • not specified
  • Paper available

libero_90_lerobot

Published Oct 2025 · apache-2.0 · daixianjie

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 3958, "total_frames": 574553, "total_tasks": 89, "total_videos": 0, "total_chunks": 4,…

  • 1,464 downloads
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

MetaFold

Published Oct 2025 · cc-by-nc-nd-4.0 · chenhn02

Dataset Card for Dataset Name MetaFold Dataset is a point-cloud trajectory dataset designed for multi-category garment folding tasks in robotic manipulation.

  • 1,770 downloads
  • 10B<n<100B
  • Paper available

RoboBench

Published Oct 2025 · cc-by-4.0 · LeoFan01

RoboBench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain 📋 Overview RoboBench is a comprehensive evaluation benchmark designed to assess the capabilities of Multimodal Large Language Models…

  • 4,665 downloads
  • 4 likes
  • n<1K
  • Paper available
  • Image
  • Text
  • JSON

aloha_mobile_cabinet

Published Oct 2025 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 85, "total_frames": 127500, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…

  • 3,230 downloads
  • 11 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

humanoid-everyday

Published Oct 2025 · apache-2.0 · USC-PSI-Lab

Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world robotic learning and embodied intelligence.

  • 25,031 downloads
  • 29 likes
  • 1K<n<10K
  • Paper available
  • Video

svla_so100_stacking

Published Oct 2025 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100", "total_episodes": 56, "total_frames": 22956, "total_tasks": 1, "chunks_size": 1000, "fps": 30, "splits": {…

  • 378 downloads
  • 3 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

berkeley_autolab_ur5_train_0_100_augmented

Published Oct 2025 · cc-by-4.0 · oxe-auge

berkeley_autolab_ur5_train_0_100 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,863 Videos: 900 Chunks: 1 Splits: train: 0:100 Data…

  • 321 downloads
  • 1K<n<10K
  • Tabular
  • Text
  • Parquet

berkeley_autolab_ur5_train_800_896_augmented

Published Oct 2025 · cc-by-4.0 · oxe-auge

berkeley_autolab_ur5_train_800_896 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 96 Frames: 9,360 Videos: 864 Chunks: 1 Splits: train: 0:96 Data…

  • 330 downloads
  • 1K<n<10K
  • Tabular
  • Text
  • Parquet

berkeley_autolab_ur5_train_400_500_augmented

Published Oct 2025 · cc-by-4.0 · oxe-auge

berkeley_autolab_ur5_train_400_500 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,479 Videos: 900 Chunks: 1 Splits: train: 0:100 Data…

  • 369 downloads
  • 1K<n<10K
  • Tabular
  • Text
  • Parquet

berkeley_autolab_ur5_train_300_400_augmented

Published Oct 2025 · cc-by-4.0 · oxe-auge

berkeley_autolab_ur5_train_300_400 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,934 Videos: 900 Chunks: 1 Splits: train: 0:100 Data…

  • 355 downloads
  • 1K<n<10K
  • Tabular
  • Text
  • Parquet

LIBERO-plus

Published Oct 2025 · mit · Sylvest

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website | 🤗 Assets | 🤗 Model | 📁 Training Dataset 🔥 Overview This repository contains the official implementation and benchmark for our…

  • 1,427 downloads
  • 23 likes
  • not specified
  • Paper available

libero_plus_rlds

Published Oct 2025 · mit · Sylvest

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website 🔥 Overview This repository contains the official implementation and benchmark for our paper "In-depth Robustness Analysis for…

  • 631 downloads
  • 6 likes
  • not specified
  • Paper available

thanos_picking_power_gem

Published Oct 2025 · apache-2.0 · imstevenpmwork

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 51, "total_frames": 16267, "total_tasks": 1, "chunks_size": 1000, "fps": 30,…

  • 633 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

something_something_v2_lerobot

Published Oct 2025 · apache-2.0 · JUNTAO123

🤖 Custom LeRobot Dataset This dataset was created using LeRobot,and follows the LeRobot v2.1 dataset specification for robotic control and imitation learning.

  • 342 downloads
  • 1 likes
  • not specified

OceanGym

Published Oct 2025 · mit · zjunlp

🌊 OceanGym 🦾 A Benchmark Environment for Underwater Embodied Agents 🌐 Home Page 📄 ArXiv Paper 🤗 Hugging Face ☁️ Google Drive ☁️ Baidu Drive OceanGym is a high-fidelity embodied underwater environment that simulates a realistic ocean…

  • 714 downloads
  • 2 likes
  • n<1K
  • Paper available

AgiBotWorld-Beta

Published Oct 2025 · not specified · agibot-world

Key Features 🔑 1 million+ trajectories from 100 robots, with a total duration of 2976.4 hours. 100+ real-world scenarios across 5 target domains.

  • 68,418 downloads
  • 61 likes
  • 100M<n<1B
  • Gated access
  • Text
  • Webdataset

OmniRetarget_Dataset

Published Oct 2025 · mit · omniretarget

OmniRetarget Dataset: Humanoid Loco-Manipulation & Scene Interaction Paper | Project Page This dataset contains motion trajectories of a G1 humanoid robot interacting with objects and complex terrains.

  • 1,100 downloads
  • 44 likes
  • not specified
  • Paper available

task_16_upload_1

Published Oct 2025 · mit · attilczuk

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…

  • 580 downloads
  • 1M<n<10M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

scannet-dataset

Published Oct 2025 · apache-2.0 · zahidpichen

This is ScanNet dataset containing indoor scenes which is used for 3d object detection, 3d segmentation, etc. Acknowledgement @inproceedings{dai2017scannet, title={ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes},…

  • 2,091 downloads
  • 2 likes
  • 1K<n<10K

L2D-v3

Published Oct 2025 · apache-2.0 · yaak-ai

This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.

  • 1,741 downloads
  • 4 likes
  • 1M<n<10M
  • Tabular
  • Text
  • Parquet

behavior1k-only-rgb

Published Oct 2025 · mit · k1000dai

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…

  • 709 downloads
  • 100M<n<1B
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

libero

Published Sep 2025 · apache-2.0 · HuggingFaceVLA

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "fps": 10.0, "splits": {…

  • 24,673 downloads
  • 51 likes
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

L2D

Published Sep 2025 · apache-2.0 · yaak-ai

TL;DR of L2D, the world's largest self-driving dataset! Read more about L2D on the official Huggingface blog: LeRobot goes to driving school 90+ TeraBytes of multimodal data (5000+ hours of driving) from 30 cities in Germany 6x surrounding…

  • 8,213 downloads
  • 45 likes
  • 10M<n<100M
  • Tabular
  • Text
  • Parquet

AgiBotWorld-Alpha

Published Sep 2025 · not specified · agibot-world

⚠️Important Notice !!! Dear Users, The Alpha Dataset has been updated as follows: Frame Loss Data Removal: Several episodes with frame loss issues have been removed.

  • 12,097 downloads
  • 220 likes
  • 10M<n<100M
  • Gated access
  • Text
  • Webdataset

MesaTask-10K

Published Sep 2025 · not specified · InternRobotics

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning 🔑Key Features MesaTask-10K, a large-scale dataset for task-oriented tabletop scene generation, comprises approximately 10,700 synthetic tabletop scenes…

  • 1,842 downloads
  • 15 likes
  • 10K<n<100K
  • Paper available
  • Gated access

svla_so100_pickplace

Published Sep 2025 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 50, "total_frames": 19631, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…

  • 1,321 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

taco_play

Published Sep 2025 · cc-by-4.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3603, "total_frames": 237798, "total_tasks": 406, "total_videos": 7206, "total_chunks": 4,…

  • 1,415 downloads
  • 2 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

stanford_kuka_multimodal_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3000, "total_frames": 149985, "total_tasks": 1, "total_videos": 3000, "total_chunks": 3,…

  • 1,219 downloads
  • 2 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

droid_100

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 32212, "total_tasks": 47, "total_videos": 300, "total_chunks": 1,…

  • 2,575 downloads
  • 9 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

berkeley_rpt

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total_videos": 908, "total_chunks": 1,…

  • 1,417 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

toto

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1003, "total_frames": 325699, "total_tasks":1, "total_videos": 1003, "total_chunks": 2,…

  • 583 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

stanford_robocook

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 2460, "total_frames": 112980, "total_tasks": 9, "total_videos": 9840, "total_chunks": 3,…

  • 1,131 downloads
  • 3 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

berkeley_mvp

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_videos": 480, "total_chunks": 1,…

  • 805 downloads
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

stanford_hydra_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total_videos": 1140, "total_chunks": 1,…

  • 949 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

utaustin_mutex

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "total_videos": 3000, "total_chunks": 2,…

  • 668 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

berkeley_autolab_ur5

Published Sep 2025 · cc-by-4.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 97939, "total_tasks": 5, "total_videos": 3000, "total_chunks": 1,…

  • 1,587 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

austin_sailor_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total_videos": 480, "total_chunks": 1,…

  • 378 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

austin_sirius_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total_videos": 1118, "total_chunks": 1,…

  • 1,301 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

nyu_door_opening_surprising_effectiveness

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 484, "total_frames": 20405, "total_tasks": 1, "total_videos": 484, "total_chunks": 1,…

  • 947 downloads
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

jaco_play

Published Sep 2025 · cc-by-4.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1085, "total_frames": 77965, "total_tasks": 89, "total_videos": 2170, "total_chunks": 2,…

  • 1,541 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

kaist_nonprehensile

Published Sep 2025 · cc-by-4.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 201, "total_frames": 32429, "total_tasks": 193, "total_videos": 201, "total_chunks": 1,…

  • 529 downloads
  • 2 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

berkeley_fanuc_manipulation

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "total_videos": 830, "total_chunks": 1,…

  • 649 downloads
  • 1 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

cmu_play_fusion

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "total_videos": 576, "total_chunks": 1,…

  • 985 downloads
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

columbia_cairlab_pusht_real

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 136, "total_frames": 27808, "total_tasks": 1, "total_videos": 272, "total_chunks": 1,…

  • 530 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

cmu_franka_exploration_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 199, "total_frames": 1990, "total_tasks": 3, "total_videos": 398, "total_chunks": 1,…

  • 610 downloads
  • 2 likes
  • 1K<n<10K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

utokyo_xarm_bimanual

Published Sep 2025 · cc-by-4.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 70, "total_frames": 1514, "total_tasks": 2, "total_videos": 70, "total_chunks": 1,…

  • 382 downloads
  • 3 likes
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

nyu_franka_play_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 456, "total_frames": 44875, "total_tasks": 1, "total_videos": 912, "total_chunks": 1,…

  • 1,109 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

ucsd_pick_and_place_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1355, "total_frames": 67750, "total_tasks": 3, "total_videos": 1355, "total_chunks": 2,…

  • 726 downloads
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

dlr_sara_pour

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 12971, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…

  • 320 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

dlr_edan_shared_control

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 104, "total_frames": 8928, "total_tasks": 14, "total_videos": 104, "total_chunks": 1,…

  • 320 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

dlr_sara_grid_clamp

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 107, "total_frames": 7622, "total_tasks": 1, "total_videos": 107, "total_chunks": 1,…

  • 339 downloads
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

ucsd_kitchen_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 150, "total_frames": 3970, "total_tasks": 8, "total_videos": 150, "total_chunks": 1,…

  • 668 downloads
  • 2 likes
  • 1K<n<10K
  • Tabular
  • Timeseries
  • Parquet

austin_buds_dataset

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 34112, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…

  • 407 downloads
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

utokyo_pr2_tabletop_manipulation

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 32708, "total_tasks": 3, "total_videos": 240, "total_chunks": 1,…

  • 633 downloads
  • 5 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

cmu_stretch

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "total_videos": 135, "total_chunks": 1,…

  • 707 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

asu_table_top

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 110, "total_frames": 26113, "total_tasks": 216, "total_videos": 110, "total_chunks": 1,…

  • 347 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

utokyo_pr2_opening_fridge

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 80, "total_frames": 11522, "total_tasks": 1, "total_videos": 80, "total_chunks": 1,…

  • 387 downloads
  • 2 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

aloha_mobile_chair

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 55, "total_frames": 110000, "total_tasks": 1, "total_videos": 165, "total_chunks": 1,…

  • 499 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_mobile_wash_pan

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_videos": 150, "total_chunks": 1,…

  • 486 downloads
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_pro_pencil

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 25, "total_frames": 8750, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…

  • 496 downloads
  • 1 likes
  • 1K<n<10K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_pingpong_test

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 10, "total_frames": 6000, "total_tasks": 1, "total_videos": 40, "total_chunks": 1,…

  • 402 downloads
  • 2 likes
  • 1K<n<10K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_cups_open

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 3,443 downloads
  • 11 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_coffee_new

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 75000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 382 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_vinh_cup_left

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 100, "total_frames": 50000, "total_tasks": 1, "total_videos": 400, "total_chunks": 1,…

  • 562 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_ziploc_slide

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 56, "total_frames": 16800, "total_tasks": 1, "total_videos": 224, "total_chunks": 1,…

  • 497 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_vinh_cup

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 101, "total_frames": 45500, "total_tasks": 1, "total_videos": 404, "total_chunks": 1,…

  • 616 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_coffee

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 886 downloads
  • 6 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_mobile_wipe_wine

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 65000, "total_tasks": 1, "total_videos": 150, "total_chunks": 1,…

  • 433 downloads
  • 3 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_candy

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 695 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

aloha_static_screw_driver

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 564 downloads
  • 3 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

xarm_push_medium_replay

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…

  • 4,122 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

xarm_push_medium

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…

  • 3,669 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

xarm_lift_medium_replay

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…

  • 3,608 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

xarm_lift_medium

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…

  • 12,281 downloads
  • 2 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

pusht

Published Sep 2025 · mit · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks":1, "total_videos": 206, "total_chunks": 1,…

  • 23,852 downloads
  • 46 likes
  • 10K<n<100K
  • Paper available
  • Tabular
  • Timeseries
  • Parquet

svla_so101_pickplace

Published Sep 2025 · apache-2.0 · lerobot

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100_follower", "total_episodes": 50, "total_frames": 11939, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…

  • 1,761 downloads
  • 32 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_MountCameraRedGripper_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Gripper", "total_episodes": 201, "total_frames": 172649, "total_tasks": 1, "total_videos": 804,…

  • 796 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

bridge_v2_lerobot_pathmask

Published Sep 2025 · apache-2.0 · jesbu1

PEEK VLM-Labeled BRIDGE_v2 dataset This dataset contains the LeRobot-format BRIDGE-v2 dataset with paths and masks from the PEEK VLM drawn onto the image: PEEK: Guiding and Minimal Image Representations for Zero-Shot Generalization of…

  • 3,958 downloads
  • 3 likes
  • not specified
  • Paper available
  • Video

G1_Dex3_ObjectPlacement_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 491 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

G1_Dex3_GraspSquare_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 410 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex3_CameraPackaging_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 320 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex3_ToastedBread_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 2,059 downloads
  • 4 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex3_BlockStacking_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 679 downloads
  • 4 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

Z1_StackBox_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Single", "total_episodes": 1265, "total_frames": 989335, "total_tasks": 1, "total_videos": 2530,…

  • 457 downloads
  • 5 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

Z1_Dual_Dex1_StackBox_Dataset_V2

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Dual", "total_episodes": 254, "total_frames": 178104, "total_tasks": 1, "total_videos": 762, "total_chunks":…

  • 638 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_MountCamera_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…

  • 381 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Brainco_GraspRubiksCube_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 197, "total_frames": 220788, "total_tasks": 1, "total_videos": 788,…

  • 333 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

G1_Brainco_GraspOreo_Dataset

Published Sep 2025 · apache-2.0 · unitreerobotics

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 201, "total_frames": 234959, "total_tasks": 1, "total_videos": 804,…

  • 496 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

SocialNav-SUB

Published Sep 2025 · mit · michaelmunje

SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation This is the accompying dataset for the Social Navigation Scene Understanding Benchmark (SocialNav-SUB) which is a Visual Question Answering (VQA) dataset…

  • 2,197 downloads
  • n<1K
  • Paper available
  • Image
  • Text
  • Csv

VLA_Instruction_Tuning

Published Sep 2025 · not specified · ShuaiYang03

This repository contains the VLA-IT dataset, a curated 650K-sample Vision-Language-Action Instruction Tuning dataset, and the SimplerEnv-Instruct benchmark.

  • 1,095 downloads
  • 4 likes
  • not specified
  • Paper available

MolmoAct-Pretraining-Mixture

Published Sep 2025 · cc-by-4.0 · allenai

MolmoAct - Pretraining Mixture Data Mixture used for MolmoAct Pretraining. Contains a subset of OXE formulated as Action Reasoning Data along with auxiliary robot data and link to Multimodal Web data.

  • 5,929 downloads
  • 10 likes
  • 10M<n<100M
  • Paper available
  • Image
  • Text
  • Parquet

so101-sort-color-joint-angles-preprocessed

Published Sep 2025 · apache-2.0 · MarkusWuenstel

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 2, "total_frames": 3394, "total_tasks": 1, "total_videos": 6, "total_chunks": 1, "chunks_size":…

  • 887 downloads
  • 10K<n<100K
  • Image
  • Tabular
  • Parquet

Griffin

Published Sep 2025 · mit · wjh-svm

Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark 📄 arXiv | 🐙 GitHub | 💾 Baidu Netdisk | 🤗 Hugging Face Griffin is the pioneering publicly available dataset for aerial-ground cooperative 3D perception.

  • 647 downloads
  • 2 likes
  • 100K<n<1M
  • Paper available

MolmoAct-Dataset

Published Sep 2025 · cc-by-4.0 · allenai

This dataset was created using LeRobot. Dataset Description This dataset contains MolmoAct Dataset in lerobot format. All contents in this dataset were collected in-house by Ai2.

  • 6,845 downloads
  • 27 likes
  • 1M<n<10M
  • Paper available
  • Image
  • Timeseries
  • Parquet

vla_demo

Published Sep 2025 · apache-2.0 · IliaLarchenko

This dataset is used for the demonstration of the Vision Language Action model finetuning. It is collected using a modified version of LeKiwi with 3 cameras, but technically, only the arm is used, so it can be treated as a dataset for…

  • 396 downloads
  • 8 likes
  • n<1K
  • Video

data

Published Aug 2025 · mit · behavior-robot-suite

Dataset Card for BEHAVIOR Robot Suite (BRS) Data This dataset provides robotic trajectories for five real-world household tasks.

  • 767 downloads
  • 9 likes
  • not specified
  • Paper available

MolmoAct-Midtraining-Mixture

Published Aug 2025 · cc-by-4.0 · allenai

MolmoAct - Midtraining Mixture Data Mixture used for MolmoAct Midtraining. Contains MolmoAct Dataset formulated as Action Reasoning Data.

  • 1,858 downloads
  • 4 likes
  • 1M<n<10M
  • Paper available
  • Image
  • Text
  • Parquet

human2locoman

Published Aug 2025 · mit · chrisyrniu

Human2LocoMan: Learning Versatile Quadrupedal Manipulation with Human Pretraining Yaru Niu1,* Yunzhe Zhang1,* Mingyang Yu1 Changyi Lin1 Chenhao Li1 Yikai Wang1 Yuxiang Yang2 Wenhao Yu2 Tingnan Zhang2 Zhenzhen Li3 Jonathan Francis1,3…

  • 426 downloads
  • 2 likes
  • n<1K
  • Paper available
  • Video

GraspClutter6D

Published Aug 2025 · cc-by-sa-4.0 · GraspClutter6D

GraspClutter6D Dataset GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes Seunghyeok Back, Joosoon Lee, Kangmin Kim, Heeseon Rho, Geonhyup Lee, Raeyoung Kang, Sangbeom Lee, Sangjun Noh,…

  • 900 downloads
  • 5 likes
  • n>1T
  • Paper available

rbo_oxe_base_language_table_lerobot

Published Aug 2025 · apache-2.0 · saaduddinM

Language Table (LeRobot) — Task-Pruned, Reindexed Subset This release is a task-pruned subset of the original IPEC-COMMUNITY/language_table_lerobot.

  • 407 downloads
  • not specified

aloha_pen_uncap

Published Aug 2025 · apache-2.0 · Voxel51

Dataset Card for aloha_pen_uncap This dataset is a FiftyOne conversion in LeRobot format of the aloha_pen_uncap_diverse subset of BiPlay.

  • 2,951 downloads
  • 1 likes
  • 100K<n<1M
  • Paper available
  • Image

Amelia42-Mini

Published Aug 2025 · bsd-2-clause · AmeliaCMU

Dataset Overview The Amelia42-Mini dataset provides air traffic position reports for 42 major U.S. airports, including the following airports: KATL (Hartsfield-Jackson Atlanta International Airport) KBDL (Bradley International Airport)…

  • 2,011 downloads
  • 4 likes
  • 1B<n<10B
  • Tabular
  • Text
  • Csv

h0_post_train_db_2508

Published Aug 2025 · mit · BeingBeyond

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos We introduce Being-H0, the first dexterous Vision-Language-Action model pretrained from large-scale human videos via explicit hand motion modeling.

  • 357 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Text
  • JSON

leisaac-pick-orange

Published Jul 2025 · apache-2.0 · LightwheelAI

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 60, "total_frames": 36293, "total_tasks": 1, "total_videos": 120, "total_chunks": 1,…

  • 468 downloads
  • 4 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

merged_libero_mask_depth_noops_lerobot_10

Published Jul 2025 · apache-2.0 · binhng

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 200, "total_frames": 33587, "total_tasks": 40, "total_videos": 1600, "total_chunks": 1,…

  • 363 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

ROVER

Published Jul 2025 · mit · iis-esslingen

The ROVER Visual SLAM Benchmark Paper News [2024/12/05] Initial code release. [2025/05/20] ROVER is accepted to IEEE Transactions on Robotics.

  • 784 downloads
  • 4 likes
  • 100M<n<1B
  • Paper available

insert-blender

Published Jul 2025 · mit · autobio-bench

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 100, "total_frames": 55127, "total_tasks": 10, "total_videos": 200, "total_chunks": 1,…

  • 507 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

thermal_cycler_open-mujoco

Published Jul 2025 · mit · autobio-bench

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 84994, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 333 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

thermal_cycler_close-mujoco

Published Jul 2025 · mit · autobio-bench

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 106292, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…

  • 400 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

ManiSkill_Demonstrations

Published Jul 2025 · apache-2.0 · haosulab

ManiSkill Demonstrations This dataset repo contains all of the latest ManiSkill demonstration datasets as well as some pretained model weights used to generate some demonstrations.

  • 2,644 downloads
  • 7 likes
  • 1M<n<10M

RoboSense

Published Jul 2025 · cc-by-sa-4.0 · suhaisheng0527

Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments Description RoboSense is a large-scale multimodal dataset constructed to facilitate egocentric robot perception…

  • 2,075 downloads
  • 7 likes
  • not specified
  • Paper available

humanoidgen_dataset

Published Jul 2025 · not specified · TeleEmbodied

HumanoidGen-Dataset 🔥 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repository provides datasets and resources for the HumanoidGen framework, enabling bimanual dexterous manipulation research.

  • 409 downloads
  • 2 likes
  • not specified
  • Paper available

libero_90_no_noops_lerobot

Published Jul 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 3921, "total_frames": 569249, "total_tasks": 73, "total_videos": 7842, "total_chunks": 3,…

  • 3,630 downloads
  • 2 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Robotics-GraspGen

Published Jun 2025 · cc-by-4.0 · nvidia

GraspGen: Scaling Sim2Real Grasping GraspGen is a large-scale simulated grasp dataset for multiple robot embodiments and grippers.

  • 1,056 downloads
  • 30 likes
  • 10K<n<100K
  • Text
  • Text

so101_multi_task_v2

Published Jun 2025 · apache-2.0 · fbeltrao

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 59, "total_frames": 20844, "total_tasks": 2, "total_videos": 118, "total_chunks": 1,…

  • 324 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams

Published Jun 2025 · cc-by-4.0 · nvidia

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a dataset from huggingface.

  • 28,902 downloads
  • 49 likes
  • n>1T
  • Paper available

so101_unplug_cable_4

Published Jun 2025 · apache-2.0 · fbeltrao

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 10, "total_frames": 4539, "total_tasks": 1, "total_videos": 20, "total_chunks": 1,…

  • 478 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

so101_unplug_cable_3

Published Jun 2025 · not specified · fbeltrao

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 49, "total_frames": 23490, "total_tasks": 1, "total_videos": 98, "total_chunks": 1,…

  • 383 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

so101-table-cleanup

Published Jun 2025 · apache-2.0 · youliangtan

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 1, "total_frames": 1358, "total_tasks": 1, "total_videos": 2, "total_chunks": 1,…

  • 850 downloads
  • 32 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-GR00T-Tuned-Tasks

Published Jun 2025 · cc-by-4.0 · nvidia

Dataset Description: This dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (GR1) tabletop manipulation tasks for industrial settings.

  • 335 downloads
  • 3 likes
  • 100K<n<1M
  • Tabular
  • Video
  • Parquet

MS-HAB-SetTable

Published Jun 2025 · mit · arth-shukla

ManiSkill-HAB SetTable Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB SetTable.

  • 408 downloads
  • 1M<n<10M
  • Paper available

MS-HAB-PrepareGroceries

Published Jun 2025 · mit · arth-shukla

ManiSkill-HAB PrepareGroceries Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB PrepareGroceries.

  • 334 downloads
  • 1M<n<10M
  • Paper available

MS-HAB-TidyHouse

Published Jun 2025 · mit · arth-shukla

ManiSkill-HAB TidyHouse Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB TidyHouse.

  • 875 downloads
  • 3 likes
  • 1M<n<10M
  • Paper available

bridge_v2_lerobot

Published Jun 2025 · apache-2.0 · jesbu1

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…

  • 15,331 downloads
  • 1 likes
  • not specified
  • Video

av_aloha_sim_cube_transfer

Published May 2025 · apache-2.0 · iantc104

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 200, "total_frames": 30000, "total_tasks": 1, "total_videos": 1200, "total_chunks": 1,…

  • 462 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

onetwovla-dataset

Published May 2025 · not specified · Richard-Nai

Datasets for OneTwoVLA [Project Page] | [Paper] | [Code] This repository provides datasets collected with the UMI, converted into the LeRobot data format, along with synthetic vision-language data used in the paper OneTwoVLA: A Unified…

  • 655 downloads
  • 6 likes
  • 100K<n<1M
  • Paper available
  • Image
  • Timeseries
  • Parquet

MoveBlock

Published May 2025 · apache-2.0 · DAbraham

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 2, "total_frames": 1731, "total_tasks":1, "total_videos": 6, "total_chunks": 1, "chunks_size":…

  • 386 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

agibot_alpha_v30

Published May 2025 · apache-2.0 · cadene

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AgiBot_A2D", "total_episodes": 28122, "total_frames": 47613574, "total_tasks": 30, "chunks_size": 1000,…

  • 2,999 downloads
  • 10M<n<100M
  • Tabular
  • Text
  • Parquet

PhysicalAI-Robotics-Manipulation-Objects

Published May 2025 · not specified · nvidia

PhysicalAI-Robotics-Manipulation-Objects is a dataset of automatic generated motions of robots performing operations such as picking and placing objects in a kitchen environment.

  • 668 downloads
  • 8 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Robotics-Manipulation-SingleArm

Published May 2025 · cc-by-4.0 · nvidia

Dataset Description: PhysicalAI-Robotics-Manipulation-SingeArm is a collection of datasets of automatic generated motions of a Franka Panda robot performing operations such as block stacking, opening cabinets and drawers.

  • 5,705 downloads
  • 19 likes
  • not specified

PhysicalAI-Robotics-Manipulation-Kitchen

Published May 2025 · cc-by-4.0 · nvidia

PhysicalAI Robotics Manipulation in the Kitchen Dataset Description: PhysicalAI-Robotics-Manipulation-Kitchen is a dataset of automatic generated motions of robots performing operations such as opening and closing cabinets, drawers,…

  • 1,477 downloads
  • 14 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

bridgev2

Published May 2025 · apache-2.0 · Qu3tzal

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…

  • 21,791 downloads
  • 2 likes
  • not specified
  • Video

EMMOE-100

Published May 2025 · apache-2.0 · Dongping-Li

EMMOE-100 Trainset Resources Project Paper Code Model Dataset Dataset Feature Task Attributes Task Example Dataset Structure EMMOE-100/ ├── README.md ├── assets/ ├── data/ │ └── train/ │ ├── 1/ │ │ ├── info.txt │ │ ├── info_re1.txt │ │ ├──…

  • 397 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Text
  • JSON

droid_1.0.1_v30_compact

Published May 2025 · apache-2.0 · cadene

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596, "chunks_size": 1000,…

  • 443 downloads
  • 1M<n<10M
  • Tabular
  • Text
  • Parquet

fmb_dataset_lerobot

Published Apr 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 8612, "total_frames": 1137459, "total_tasks": 24, "total_videos": 34448, "total_chunks": 9,…

  • 6,440 downloads
  • 1 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

hamburger_heti

Published Apr 2025 · apache-2.0 · HuaihaiLyu

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 598, "total_frames": 285571, "total_tasks": 9, "total_videos": 1794, "total_chunks": 1,…

  • 472 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

droid_lerobot

Published Apr 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 92233, "total_frames": 27044326, "total_tasks": 31308, "total_videos": 276699,…

  • 299,775 downloads
  • 12 likes
  • not specified

droid_1.0.1_v30

Published Apr 2025 · apache-2.0 · cadene

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596, "chunks_size": 1000,…

  • 4,442 downloads
  • 4 likes
  • 10M<n<100M
  • Tabular
  • Text
  • Parquet

agent-reward-bench

Published Apr 2025 · not specified · McGill-NLP

AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Meade, Arkil Patel, Dongchan Shin,…

  • 2,747 downloads
  • 4 likes
  • 1K<n<10K
  • Paper available
  • Image
  • Text
  • Csv

libero-10-lerobot

Published Apr 2025 · apache-2.0 · Felix-Zhenghao

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101499, "total_tasks": 10, "total_videos": 0, "total_chunks": 1,…

  • 1,539 downloads
  • 3 likes
  • 100K<n<1M
  • Image
  • Timeseries
  • Parquet

hamburger

Published Apr 2025 · apache-2.0 · HuaihaiLyu

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 390, "total_frames": 219625, "total_tasks": 19, "total_videos": 1170, "total_chunks": 1,…

  • 431 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

Retargeted_AMASS_for_bxi_elf2

Published Apr 2025 · cc-by-4.0 · fleaven

Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and applications in robotics and human-robot…

  • 3,906 downloads
  • 10K<n<100K

Retargeted_AMASS_for_FourierN1

Published Apr 2025 · cc-by-4.0 · fleaven

Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and applications in robotics and human-robot…

  • 2,174 downloads
  • 10K<n<100K

Retargeted_AMASS_for_robotics

Published Apr 2025 · cc-by-4.0 · fleaven

Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and applications in robotics and human-robot…

  • 4,151 downloads
  • 33 likes
  • 10K<n<100K

PH2D

Published Apr 2025 · mit · RogerQi

Dataset Description This dataset contains egocentric human-humanoid data that can be used to co-train manipulation policy for humanoid robot.

  • 5,532 downloads
  • 10 likes
  • not specified
  • Paper available

libero_spatial_no_noops_lerobot_v21

Published Apr 2025 · apache-2.0 · aopolin-lv

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "total_videos": 864, "total_chunks": 1,…

  • 698 downloads
  • 2 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

reactive_diffusion_policy_dataset

Published Apr 2025 · mit · WendiChen

Dataset of Reactive Diffusion Policy Contents Description Structure Usage Tactile Dataset Description This is the raw and postprocessed dataset used in the paper Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for…

  • 561 downloads
  • 2 likes
  • n<1K
  • Paper available
  • Video

AMASS_Retargeted_for_G1

Published Mar 2025 · cc-by-4.0 · ember-lab-berkeley

Retargeted AMASS Dataset for G1 This repository contains 100% of the AMASS dataset retargeted to the G1 humanoid and formatted for use with IsaacLab's AMP motion loader. It should also be compatible with ProtoMotions.

  • 3,339 downloads
  • 20 likes
  • 10K<n<100K

Maniskill-Pushcube-demonstration-large

Published Mar 2025 · apache-2.0 · AdilZtn

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 1860, "total_frames": 0, "total_tasks": 0, "total_videos": 0, "total_chunks": 0, "chunks_size":…

  • 351 downloads
  • 10K<n<100K
  • Image
  • Timeseries
  • Parquet

droid_1.0.1

Published Mar 2025 · apache-2.0 · cadene

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Franka", "total_episodes": 95600, "total_frames": 27612581, "total_tasks": 0, "total_videos": 286800, "total_chunks":…

  • 304,654 downloads
  • 31 likes
  • not specified

language_table_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605, "total_videos": 442226, "total_chunks":…

  • 165,979 downloads
  • 1 likes
  • not specified

bc_z_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 104, "total_videos": 39350,…

  • 3,535 downloads
  • 1 likes
  • not specified
  • Video

Open X-Embodiment Dataset: 1M+ Robot Trajectories Across 22 Embodiments

Published Mar 2025 · Dataset profile

Open X-Embodiment is a 1M+ trajectory dataset spanning 22 robot embodiments—from single arms to bi-manual systems and quadrupeds—released in October 2023 to train cross-embodiment policies like RT-X.

berkeley_fanuc_manipulation_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "fanuc_mate", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "total_videos": 830, "total_chunks": 1,…

  • 474 downloads
  • 1 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

dexmimicgen_datasets

Published Mar 2025 · cc-by-nc-sa-4.0 · MimicGen

DexMimicGen Datasets This repository contains the official dataset release of simulation environments and datasets for the ICRA 2025 paper "DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning".

  • 420 downloads
  • 1 likes
  • n<1K
  • Paper available
  • Image
  • Imagefolder

ManiSkill2

Published Mar 2025 · apache-2.0 · haosulab

ManiSkill2 Data Update: ManiSkill 3 has been released https://github.com/haosulab/ManiSkill/. It uses different datasets than ManiSkill2 so the data here is not expected to transfer over ManiSkill2 is a unified benchmark for learning…

  • 894 downloads
  • 11 likes
  • 1M<n<10M
  • Paper available

roboturk_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "sawyer", "total_episodes": 1796, "total_frames": 168423, "total_tasks": 3, "total_videos": 1796, "total_chunks": 2,…

  • 1,747 downloads
  • 1 likes
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

utaustin_mutex_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "total_videos": 3000, "total_chunks": 2,…

  • 1,489 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

viola_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 135, "total_frames": 68913, "total_tasks": 3, "total_videos": 270, "total_chunks": 1,…

  • 763 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

taco_play_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "total_videos": 6484, "total_chunks": 4,…

  • 2,377 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

iamlab_cmu_pickup_insert_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 631, "total_frames": 146241, "total_tasks": 7, "total_videos": 1262, "total_chunks": 1,…

  • 1,192 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

toto_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 902, "total_frames": 294139, "total_tasks": 1, "total_videos": 902, "total_chunks": 1,…

  • 1,263 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

stanford_hydra_dataset_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total_videos": 1140, "total_chunks": 1,…

  • 1,028 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

nyu_franka_play_dataset_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 365, "total_frames": 34448, "total_tasks": 1, "total_videos": 730, "total_chunks": 1,…

  • 622 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

nyu_door_opening_surprising_effectiveness_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 435, "total_frames": 18196, "total_tasks": 1, "total_videos": 435, "total_chunks": 1,…

  • 528 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

jaco_play_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "jaco_2", "total_episodes": 976, "total_frames": 70127, "total_tasks": 88, "total_videos": 1952, "total_chunks": 1,…

  • 1,335 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

cmu_play_fusion_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "total_videos": 576, "total_chunks": 1,…

  • 980 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

cmu_stretch_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "total_videos": 135, "total_chunks": 1,…

  • 370 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

berkeley_mvp_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_videos": 480, "total_chunks": 1,…

  • 531 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

berkeley_cable_routing_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1482, "total_frames": 38240, "total_tasks": 1, "total_videos": 5928, "total_chunks": 2,…

  • 1,046 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

berkeley_autolab_ur5_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5", "total_episodes": 896, "total_frames": 87783, "total_tasks": 5, "total_videos": 1792, "total_chunks": 1,…

  • 2,118 downloads
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

austin_sirius_dataset_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total_videos": 1118, "total_chunks": 1,…

  • 1,040 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

austin_sailor_dataset_lerobot

Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total_videos": 480, "total_chunks": 1,…

  • 803 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

colosseum-challenge

Published Mar 2025 · mit · colosseum

Colosseum Dataset Card This dataset contains demonstrations for training and testing Imitation Learning based policies, taken from our simulation benchmark Colosseum, which is based on RLBench.

  • 739 downloads
  • 2 likes
  • not specified
  • Paper available

droid

Published Feb 2025 · apache-2.0 · cadene

This dataset was created using LeRobot. DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset One of the biggest open-source dataset for robotics with 27.044,326 frames, 92,223 episodes, 31,308 unique task description in natural…

  • 346,008 downloads
  • 13 likes
  • 10M<n<100M
  • Paper available
  • Video

kuka_lerobot

Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "kuka_iiwa", "total_episodes": 209880, "total_frames": 2455879, "total_tasks": 1, "total_videos": 209880, "total_chunks":…

  • 46,810 downloads
  • not specified
  • Video

fractal20220817_data_lerobot

Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 599, "total_videos": 87212,…

  • 96,222 downloads
  • 12 likes
  • not specified
  • Video

bridge_orig_lerobot

Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…

  • 151,758 downloads
  • 15 likes
  • not specified
  • Video

berkeley_rpt_lerobot

Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total_videos": 908, "total_chunks": 1,…

  • 913 downloads
  • 100K<n<1M
  • Tabular
  • Timeseries
  • Parquet

furniture_bench_dataset_lerobot

Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "total_videos": 10200, "total_chunks": 6,…

  • 2,692 downloads
  • 1M<n<10M
  • Tabular
  • Timeseries

dobbe_lerobot

Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 5208, "total_frames": 1139911, "total_tasks": 7, "total_videos": 5208, "total_chunks":…

  • 2,275 downloads
  • 1 likes
  • 1M<n<10M
  • Tabular
  • Timeseries
  • Parquet

aloha_pen_uncap_diverse

Published Feb 2025 · apache-2.0 · physical-intelligence

This dataset was created using LeRobot. Dataset Description This dataset is a lerobot conversion of the aloha_pen_uncap_diverse subset of BiPlay.

  • 1,059 downloads
  • 7 likes
  • 10K<n<100K
  • Paper available
  • Image
  • Timeseries
  • Parquet

GEMBench

Published Feb 2025 · mit · rjgpinel

Dataset Card for GEMBench dataset 💎 GEneralizable vision-language robotic Manipulation Benchmark Dataset A benchmark to systematically evaluate generalization capabilities of vision-and-language robotic manipulation policies.

  • 596 downloads
  • 3 likes
  • 10K<n<100K
  • Paper available

libero

Published Feb 2025 · cc-by-4.0 · physical-intelligence

This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10.

  • 40,682 downloads
  • 78 likes
  • 100K<n<1M
  • Paper available
  • Image
  • Timeseries
  • Parquet

FuSe

Published Jan 2025 · mit · oier-mees

The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments, annotated with natural language.

  • 1,273 downloads
  • 4 likes
  • not specified
  • Paper available

BiPlay

Published Jan 2025 · mit · oier-mees

BiPlay contains 9.7 hours of bimanual data collected with an aloha robot at the RAIL lab @ UC Berkeley, USA. It contains 7023 clips, 2000 language annotations and 326 unique scenes.

  • 1,563 downloads
  • 15 likes
  • not specified
  • Paper available

Humanoid-X

Published Jan 2025 · not specified · USC-PSI-Lab

Humanoid-X 🌐 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repo contains the officail dataset for the paper "Learning from Massive Human Videos for Universal Humanoid Pose Control" If you like our project, please give us a…

  • 536 downloads
  • 14 likes
  • 100K<n<1M
  • Paper available
  • Text
  • Text

koch_test

Published Jan 2025 · apache-2.0 · danaaubakirova

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "koch", "total_episodes": 51, "total_frames": 16602, "total_tasks": 1, "total_videos": 102, "total_chunks": 1,…

  • 1,255 downloads
  • 7 likes
  • 10K<n<100K
  • Tabular
  • Timeseries
  • Parquet

GoPro-Raw-Videos

Published Nov 2024 · mit · Fanqi-Lin

Raw GoPro Videos for Four Robotic Manipulation Tasks [Project Page] [Paper] [Code] [Models] [Processed Dataset] This repository contains raw GoPro videos of robotic manipulation tasks collected in-the-wild using UMI, as described in the…

  • 948 downloads
  • 1 likes
  • 10K<n<100K
  • Paper available
  • Video

Processed-Task-Dataset

Published Nov 2024 · not specified · Fanqi-Lin

Robotic Manipulation Datasets for Four Tasks [Project Page] [Paper] [Code] [Models] [Raw GoPro Videos] This repository contains in-the-wild robotic manipulation datasets collected using UMI, and processed through a SLAM pipeline, as…

  • 441 downloads
  • 7 likes
  • 100B<n<1T
  • Paper available

OpenX-Embodiment

Published Oct 2024 · cc-by-4.0 · jxu124

Open X-Embodiment Dataset (unofficial) This is an unofficial Dataset Repo. This Repo is set up to make Open X-Embodiment Dataset (55 in 1) more accessible for people who love huggingface🤗.

  • 19,757 downloads
  • 102 likes
  • 1M<n<10M

koch_bimanual_folding

Published Aug 2024 · not specified · cadene

This dataset was created using 🤗 LeRobot.

  • 502 downloads
  • 2 likes
  • not specified

functional-manipulation-benchmark

Published Jan 2024 · cc-by-4.0 · charlesxu0124

Functional Manipulation Benchmark This robot learning dataset is a part of the paper "FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning".

  • 654 downloads
  • 5 likes
  • not specified
  • Paper available

608 remaining

BROWSE BY MODALITY

BROWSE BY TASK

BROWSE BY FORMAT

BROWSE BY LICENSE

TRUELABEL ROUTING

Public data rarely closes the whole gap.

If the directory surfaces a missing modality, geography, license, or deployment environment, tell us what you need and our team will route the request.

Request custom data