truelabel

ROBOTICS DATASET INDEX

Hugging Face robotics datasets

A broad source-backed watchlist for physical AI teams comparing robotics datasets, licenses, formats, modalities, and ingestion risk across the Hugging Face Hub.

DIRECT ANSWER

This index tracks 1,000 Hugging Face records tagged for robotics. It is the breadth layer for truelabel: use it to discover candidates, then promote important records into deeper dataset profiles, comparisons, and custom collection specs.

Current crawl

Indexed records

1,000 robotics-tagged dataset records.

Hub signal

Observed downloads

5,343,188 combined downloads across the indexed records.

Refresh timestamp

Generated at

May 1, 2026, 9:28 PM

BREADTH MAP

What the Hugging Face robotics surface looks like

Dataset owners

Top authors

  • RoboCOIN: 326
  • lerobot: 99
  • unitreerobotics: 37
  • IPEC-COMMUNITY: 30
  • lerobot-data-collection: 14
  • nvidia: 13
  • oxe-auge: 11
  • Factory-Intelligence: 10
  • InternRobotics: 10
  • ByteRainx: 9
  • Mimic-Robotics: 8
  • villekuosmanen: 7

Terms signal

Top licenses

  • Apache 2.0: 691
  • Mit: 133
  • Cc By 4.0: 74
  • Not Specified: 39
  • Cc By Nc Sa 4.0: 18
  • Cc By Nc 4.0: 17
  • Other: 9
  • Cc By Sa 4.0: 8

Data shape

Top modalities

  • Video: 582
  • Timeseries: 562
  • Tabular: 534
  • Image: 113
  • Text: 87
  • 3d: 11
  • Audio: 5
  • Document: 5

616 INDEXABLE RECORDS

Filter and search the Hugging Face robotics watchlist

Tier-A and Tier-B records only. The remaining 384 records are consolidated under author clusters.

License
Modality
Format

616 of 616 datasets

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

350,718 downloads · cc-by-4.0

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robo

droid

346,008 downloads · apache-2.0

This dataset was created using LeRobot. DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset One of the biggest open-source dataset for robotics with 27.044,326 frames, 92,223 episodes, 31,308

  • Video

droid_1.0.1

304,654 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Franka", "total_episodes": 95600, "total_frames": 27612581, "total_tasks": 0, "to

droid_lerobot

299,775 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 92233, "total_frames": 27044326, "total_tasks": 31308,

OmniWorld

200,677 downloads · cc-by-nc-sa-4.0

[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling 🎉NEWS [2026.3.21] 🔥 OmniWorld-Game with Metric Scale is now released! Check out our latest model Pi3X (an enhanced

  • Image
  • Text
  • Webdataset

10Kh-RealOmin-OpenData

200,660 downloads · cc-by-sa-4.0

Boasting over 13,000 hours of cumulative data and 5 million+ clips, it ranks as the largest open-source embodied intelligence dataset in the industry. Update Notes:Stage 3 data upload completed. 13,00

  • Video

xperience-10m

188,007 downloads · other

⚠️ Important: If you have already submitted an access request but have not completed the required DocuSign agreement, your request will remain pending. Please complete signing and we will grant access

  • 3d
  • Audio

language_table_lerobot

165,979 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605,

bridge_orig_lerobot

151,758 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974,

  • Video

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

126,607 downloads · cc-by-4.0

PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robo

fractal20220817_data_lerobot

96,222 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 5

  • Video

kumagong

78,972 downloads · cc-by-nc-sa-4.0

KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task

2025-challenge-demos

75,097 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t

  • Video

AgiBotWorld-Beta

68,418 downloads · not specified

Key Features 🔑 1 million+ trajectories from 100 robots, with a total duration of 2976.4 hours. 100+ real-world scenarios across 5 target domains. Cutting-edge hardware: visual tactile sensors / 6-DoF

  • Text
  • Webdataset

OmniAction

63,389 downloads · cc-by-nc-4.0

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language

bridgev2

60,561 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,

bridge_orig_lerobot

54,264 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974,

roboverse_data

50,345 downloads · apache-2.0

This dataset is part of the RoboVerse project, as described in the paper RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning.

OmniAction

47,824 downloads · cc-by-nc-4.0

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language

XVLA-Soft-Fold

47,595 downloads · mit

Cloth-Folding Dataset for X-VLA Paper This dataset contains 1,500 episodes of cloth folding, collected using Agilex's robotic arm. It was used in the X-VLA paper for cloth-folding tasks, showcasing a

AgiBotWorld2026

47,447 downloads · cc-by-nc-sa-4.0

AgiBot World 2026 Real-World Embodied Intelligence Dataset Overview As robotics research advances into real-world scenarios, the demand for authentic, high-quality data has become increasingly urgent.

  • Image
  • Text

InternData-N1

47,044 downloads · cc-by-sa-4.0

InternData-N1 InternData-N1 is a large-scale, unified vision-language navigation dataset consolidating multiple benchmarks into a standardized format. With over 240,000 trajectories across 3,000+ scen

kuka_lerobot

46,810 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "kuka_iiwa", "total_episodes": 209880, "total_frames": 2455879, "total_tasks": 1,

  • Video

ManiTwin-100K

44,824 downloads · cc-by-nc-4.0

ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research

libero

40,682 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10. All datasets were taken fro

  • Image
  • Timeseries
  • Parquet

PhysicalAI-Robotics-Open-H-Embodiment

37,477 downloads · cc-by-4.0

Dataset Description: Open-H-Embodiment is a community‑driven dataset initiative building the open, shared foundation needed to train and evaluate AI autonomy models for surgical robotics and ultrasoun

  • Image
  • Video

computer-use-large

37,008 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

InternData-A1

36,451 downloads · not specified

InternData-A1 InternData-A1 is a hybrid synthetic-real manipulation dataset containing over 630k trajectories and 7,433 hours across 4 embodiments, 18 skills, 70 tasks, and 227 scenes, covering rigid,

  • 3d
  • Image

OmniAction

33,980 downloads · cc-by-nc-4.0

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language

PhysicalAI-Robotics-Manipulation-Kitchen-Demos

30,417 downloads · cc-by-4.0

PhysicalAI-Robotics-Manipulation-Kitchen-Demos We provide a 600 hours of human-teleoperated demonstrations across 316 different tasks, totalling 55k trajectories. The datasets are collected using Fran

maniparena-dataset

30,062 downloads · apache-2.0

ManipArena Dataset Training dataset for ManipArena, a real-robot benchmark and competition for bimanual manipulation at the CVPR 2026 Embodied AI Workshop. This dataset provides rich multi-modal demon

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams

28,902 downloads · cc-by-4.0

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a

computer-use-large

28,380 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

DECO-50

27,922 downloads · apache-2.0

DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter DECO-50 is a bimanual dexterous manipulation dataset with tactile sensing, comprising

  • Image
  • Text
  • Webdataset

aloha_sim_transfer_cube_human

26,167 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

InternData-A1-LeRobot-v3.0-by-embodiment

25,327 downloads · cc-by-nc-sa-4.0

InternData-A1 dataset taken from InternRobotics/InternData-A1, with the tarballs extracted and directory structure "transposed" so that the top-level subdirectories are the four embodiments. Two frank

  • Tabular
  • Timeseries

ViFailback-Dataset

25,323 downloads · mit

ViFailback Dataset: Real-World Robotic Manipulation Failure Dataset with Visual Symbol Guidance A real-world dataset for diagnosing, correcting, and learning from robotic manipulation failures via vis

  • Image
  • Imagefolder

humanoid-everyday

25,031 downloads · apache-2.0

Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world roboti

  • Video

libero

24,673 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunk

  • Image
  • Timeseries
  • Parquet

pusht

23,852 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks":1, "total_v

  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams

22,090 downloads · cc-by-4.0

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a

bridgev2

21,791 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,

  • Video

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams

21,084 downloads · cc-by-4.0

PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a

high_quality_folding

20,763 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks

  • Tabular
  • Timeseries
  • Parquet

OpenX-Embodiment

19,757 downloads · cc-by-4.0

Open X-Embodiment Dataset (unofficial) This is an unofficial Dataset Repo. This Repo is set up to make Open X-Embodiment Dataset (55 in 1) more accessible for people who love huggingface🤗. Open X-Emb

level12_rac_2_2026-02-07_and_mir

18,546 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 10770, "total_frames": 26748966, "total_tas

  • Tabular
  • Timeseries
  • Parquet

RoboMIND

18,187 downloads · apache-2.0

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation Accepted by Robotics: Science and Systems (RSS) 2025. 📢 ANNOUNCEMENT | RoboMIND V2.0 Release We're excited t

droid_success

16,949 downloads · cc-by-4.0

DROID Success Episodes Successful episodes in DROID-COMMUNITY. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~89% of all successful episodes. Ported from raw

InternScenes

16,673 downloads · not specified

InternScenes InternScenes is a large-scale interactive indoor scene dataset with realistic layouts. This dataset comprises approximately 40,000 diverse scenes and 1.96M 3D objects that cover 15 common

  • 3d
  • Image
  • Webdataset

aloha_sim_insertion_human

16,413 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

droid_1.0.1

15,868 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630,

  • Tabular
  • Text
  • Parquet

ArtVIP

15,632 downloads · apache-2.0

ArtVIP dataset card 🎉🎉🎉 ArtVIP is accepted by ICLR 2026 Key Features ✅ 206 high-quality digital-twin articulated objects. ✅ 6 pre-configured digital-twin scenes, 6 user-defined scenes ✅ Reusable Mo

full_folding

15,605 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5688, "total_frames": 14129038, "total_task

  • Tabular
  • Timeseries
  • Parquet

bridge_v2_lerobot

15,331 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,

  • Video

3d_optical_flow_droid

14,567 downloads · mit

3D Optical Flow DROID Dataset Processed DROID robotics dataset with optical flow and scene flow annotations. Dataset Structure Organized by lab, each trajectory in separate tar.gz archive: IPRL/IPRL+2

  • Image
  • Text
  • Webdataset

computer-use-large

14,562 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

aloha_sim_insertion_scripted

14,048 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

FreeTacMan

13,952 downloads · mit

📦 FreeTacman Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation [ICRA 2026] 🎯 Overview This dataset supports the paper FreeTacman: Robot-free Visuo-Tactile Data Collection

  • Video

level12_rac_2_2026-02-07

13,951 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5551, "total_frames": 13805156, "total_task

  • Tabular
  • Timeseries
  • Parquet

pusht_image

13,329 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks": 1, "total_

  • Image
  • Timeseries
  • Parquet

ManiTwin-100K

12,690 downloads · apache-2.0

ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research

RoboChallenge-lerobot-merged

12,422 downloads · apache-2.0

RoboChallenge-lerobot-merged Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset (

  • Tabular
  • Text
  • Parquet

xarm_lift_medium

12,281 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

aloha_sim_transfer_cube_scripted

12,120 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "chunks_si

  • Tabular
  • Timeseries
  • Parquet

Kai0

12,110 downloads · cc-by-nc-sa-4.0

KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task

  • Tabular
  • Timeseries
  • Parquet

AgiBotWorld-Alpha

12,097 downloads · not specified

⚠️Important Notice !!! Dear Users, The Alpha Dataset has been updated as follows: Frame Loss Data Removal: Several episodes with frame loss issues have been removed. For the complete list of removed e

  • Text
  • Webdataset

humanoid-everyday

10,991 downloads · apache-2.0

Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world roboti

  • Video

AgiBot-g1_picks_up_parts_b

10,455 downloads · apache-2.0

AgiBot-g1_picks_up_parts_b 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Robot Type: ruantong_a2d | Codebase Version: v2.1 End-Effector Type:

  • Tabular
  • Timeseries
  • Parquet

GNN_Disassembly_WorldModel

10,423 downloads · cc-by-4.0

GNN Constraint-Aware World Model Dataset (v3) Real robot episodes with per-frame constraint graphs, SAM2 segmentation masks + 256-D feature embeddings, full 3D depth bundles, and synchronized robot st

RealSource-World

10,280 downloads · cc-by-nc-sa-4.0

RealSource World RealSource World is a large-scale real-world robotics manipulation dataset collected using RS-02 dual-arm humanoid robot. This dataset contains diverse long-horizon manipulation tasks

level12_rac_2_2026-02-05_ag

9,997 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5413, "total_frames": 13497374, "total_task

  • Tabular
  • Timeseries
  • Parquet

aloha_sim_insertion_human_image

9,832 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_vid

  • Image
  • Timeseries
  • Parquet

aloha_sim_transfer_cube_human_image

9,767 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid

  • Image
  • Timeseries
  • Parquet

cache

9,744 downloads · bsd-3-clause

ThinkJEPA Proprocessed Data Cache Dataset Description This repository contains the released preprocessed cache used by ThinkJEPA: ThinkJEPA: Empowering Latent World Models with Large Vision-Language R

  • Text
  • Video
  • Text

level12_rac_2_2026-02-04

9,734 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5357, "total_frames": 13251592, "total_task

  • Tabular
  • Timeseries
  • Parquet

aloha_sim_transfer_cube_scripted_image

9,650 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid

  • Image
  • Timeseries
  • Parquet

grand_tour_dataset

9,615 downloads · mit

The GrandTour Dataset A project brought to you by RSL - ETH Zurich. References • Contributing • Citation References Official dataset webpage: grand-tour.leggedrobotics.com Getting started & examples:

aloha_sim_insertion_scripted_image

9,610 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid

  • Image
  • Timeseries
  • Parquet

RoboCerebra_lerobot_v2.1

9,034 downloads · apache-2.0

RoboCerebra (LeRobot v2.1 Format) Dataset Description This dataset is a converted version of RoboCerebra, now formatted for compatibility with the LeRobot ecosystem (v2.1). While the official RoboCere

  • Video

metaworld_mt50

8,738 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c

  • Image
  • Timeseries
  • Parquet

L2D

8,213 downloads · apache-2.0

TL;DR of L2D, the world's largest self-driving dataset! Read more about L2D on the official Huggingface blog: LeRobot goes to driving school 90+ TeraBytes of multimodal data (5000+ hours of driving) f

  • Tabular
  • Text
  • Parquet

Sim1_Dataset

8,119 downloads · apache-2.0

Sim1_Dataset LeRobot-format manipulation dataset for cloth folding / deformable object tasks. Format This repository follows a LeRobot-style layout per subset: data/chunk-xxx/episode_XXXXXX.parquet: f

  • Tabular
  • Timeseries

LARYBench

8,102 downloads · cc-by-nc-sa-4.0

LARY — A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment LARY is a unified evaluation framework for latent action representations. Given any model that pro

agent-reward-bench

7,568 downloads · not specified

AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Mead

  • Image
  • Text
  • Csv

Bridge-CoT

7,252 downloads · cc-by-4.0

Bridge-CoT Robot manipulation dataset with chain-of-thought annotations, derived from BridgeDataV2. Each sample pairs a scene image with a task description and structured VLM-generated annotations inc

Agilex_Cobot_Magic_place_towel_flat

7,053 downloads · apache-2.0

Agilex_Cobot_Magic_place_towel_flat Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Task Preview View Video Directly Overview Total Epis

DOM

6,931 downloads · other

Dynamic Object Manipulation (DOM) Project Page | Paper | Code TL;DR: DOM is a large-scale dynamic manipulation dataset with 200K episodes, 2,800+ scenes, and 206 objects for training and evaluating VL

droid_success_high_quality

6,916 downloads · cc-by-4.0

DROID Success (High-Quality Extrinsics) Subset of ~17k successful episodes in DROID-COMMUNITY filtered for high-quality camera extrinsics. Ported from raw 1.0.1 data at full resolution to LeRobotDatas

thesis-reconstructed-trajectories

6,868 downloads · mit

Reconstructed GNM Trajectories GNM-format trajectories reconstructed from RECON, SCAND, GoStanford2, and other navigation datasets. Used as training data for the CAST safety pipeline and Neural CBF. S

  • Image

MolmoAct-Dataset

6,845 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Description This dataset contains MolmoAct Dataset in lerobot format. All contents in this dataset were collected in-house by Ai2. Quick links: 📂 All M

  • Image
  • Timeseries
  • Parquet

SandThink

6,566 downloads · mit

SandThink Dataset (v1.0) SandThink 是一个专为具身智能 (Embodied AI) 任务设计的大规模指令微调与偏好对齐数据集。该数据集通过结构化的 Chain-of-Thought (CoT) 推理过程,显著提升了 Vision-Language-Action (VLA) 模型在复杂环境下的任务拆解、路径规划和动作执行能力。 📊 数据集概览 (Dataset S

  • Image
  • Imagefolder

droid_v3

6,517 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630,

  • Tabular
  • Text
  • Parquet

fmb_dataset_lerobot

6,440 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 8612, "total_frames": 1137459, "total_tasks": 24, "tot

  • Tabular
  • Timeseries
  • Parquet

comet-1.5k

6,384 downloads · mit

Comet-1.5k — RFT Dataset (Openpi-Comet) This repository releases Comet-1.5k, an RFT (Rejection Sampling Fine-Tuning) dataset generated by Team Comet’s post-training pipeline on the BEHAVIOR-1K simulat

RoboChallenge-lerobot

6,361 downloads · apache-2.0

RoboChallenge-lerobot Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset in LeRob

  • Tabular
  • Timeseries
  • Parquet

Agilex_Cobot_Magic_heat_burger

6,311 downloads · apache-2.0

Agilex_Cobot_Magic_heat_burger Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Task Preview View Video Directly Overview Total Episodes:

Cobot_Magic_make_hamburger

6,044 downloads · apache-2.0

Cobot_Magic_make_hamburger 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Robot Type: agilex_cobot_decoupled_magic | Codebase Version: v2.1 End

  • Tabular
  • Timeseries
  • Parquet

Edge-Agent-Reasoning-WebSearch-260K

5,993 downloads · mit

Edge Agent Reasoning WebSearch 260K Abstract The Edge-Agent-Reasoning-WebSearch-260K dataset is a massive, synthetically expert-engineered corpus of over 700 Million tokens, designed to train small, l

  • 3d
  • Audio
  • Parquet

GR1-Tabletop-NextState-1000x24

5,976 downloads · apache-2.0

GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobo

  • Tabular
  • Video
  • Parquet

MolmoAct-Pretraining-Mixture

5,929 downloads · cc-by-4.0

MolmoAct - Pretraining Mixture Data Mixture used for MolmoAct Pretraining. Contains a subset of OXE formulated as Action Reasoning Data along with auxiliary robot data and link to Multimodal Web data.

  • Image
  • Text
  • Parquet

Agilex_Cobot_Magic_storage_towel

5,830 downloads · apache-2.0

Agilex_Cobot_Magic_storage_towel Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Task Preview View Video Directly Overview Total Episode

wanderland

5,814 downloads · apache-2.0

Wanderland Dataset Dataset Description Wanderland is a large-scale urban dataset designed for geometrically grounded simulation and open-world embodied AI research. The dataset contains diverse urban

  • 3d
  • Text
  • Text

PDD

5,746 downloads · cc-by-nc-4.0

PDD: Personalized Driving Dataset Dataset Description PDD (Personalized Driving Dataset) is a multi-driver, multi-scenario driving dataset collected in CARLA 0.9.15. It captures real human driving beh

  • Image

PhysicalAI-Robotics-Manipulation-SingleArm

5,705 downloads · cc-by-4.0

Dataset Description: PhysicalAI-Robotics-Manipulation-SingeArm is a collection of datasets of automatic generated motions of a Franka Panda robot performing operations such as block stacking, opening

RoboInter-Data

5,682 downloads · not specified

RoboInter-Data: Intermediate Representation Annotations for Robot Manipulation Rich, dense, per-frame intermediate representation annotations for robot manipulation, built on top of DROID and RH20T. D

  • Text
  • Video
  • JSON

PH2D

5,532 downloads · mit

Dataset Description This dataset contains egocentric human-humanoid data that can be used to co-train manipulation policy for humanoid robot. For paper, visualization, and example code using this data

r3d

5,403 downloads · not specified

R3D: Revisiting 3D Policy Learning Project Page | Paper | GitHub This repository contains the pre-processed data for R3D, a framework for robust and scalable 3D imitation learning. R3D identifies the

SAGE-3D_InteriorGS_usdz

5,400 downloads · cc-by-nc-4.0

SAGE-3D InteriorGS USDZ: USDZ-Format 3D Gaussian Scenes for Isaac Sim Paper | Project Page | Code InteriorGS dataset converted to USDZ format for seamless integration with NVIDIA Omniverse and Isaac S

  • 3d
  • Image
  • Imagefolder

example_data_fastumi_pro_raw

5,360 downloads · other

FastUMI Pro – Multimodal Sample Dataset Small-Scale Demonstration Data from the FastUMI Pro Multimodal Sensing System (Only Hundreds of Trajectories — Full Dataset Available Upon Request) Project Home

wm-eval-samples-ewmbench

5,057 downloads · cc-by-nc-sa-4.0

WM-Eval Samples: EWMBench Generated Videos Pre-generated video samples for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework. Use this dataset as vi

  • Image
  • Imagefolder

CostNav-Teleop-Dataset

5,037 downloads · apache-2.0

CostNav Teleop Dataset Dataset Summary The CostNav Teleop Dataset is a large-scale collection of human teleoperation recordings for robot navigation in an urban sidewalk simulation environment. It was

  • Tabular
  • Text
  • Csv

lead

4,952 downloads · mit

LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving Project Page | Paper | Code Official CARLA dataset accompanies our paper LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving

RoboCerebra_hdf5

4,938 downloads · apache-2.0

📦 RoboCerebra HDF5 Dataset This repository contains the structured HDF5 dataset for the RoboCerebra project, optimized for high-performance robot learning training. ✅ Status: Data Upload Complete. Al

  • Video

PhysicalAI-SimReady-Warehouse-01

4,673 downloads · cc-by-4.0

NVIDIA Physical AI SimReady Warehouse OpenUSD Dataset Dataset Version: 1.1.0 Date: May 18, 2025 Author: NVIDIA, Corporation License: CC-BY-4.0 (Creative Commons Attribution 4.0 International) Contents

  • Image
  • Text
  • Csv

RoboBench

4,665 downloads · cc-by-4.0

RoboBench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain 📋 Overview RoboBench is a comprehensive evaluation benchmark designed to assess the capabilities

  • Image
  • Text
  • JSON

scenesmith-example-scenes

4,597 downloads · apache-2.0

SceneSmith Example Scenes Project Page | Paper | Code Example scenes generated by SceneSmith, a hierarchical agentic framework for constructing simulation-ready indoor environments from natural langua

computer-use-large

4,534 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

community_dataset_v3

4,529 downloads · apache-2.0

Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets acro

bridge-rlds

4,466 downloads · mit

Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Bridge dataset

libero

4,453 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks":40, "chunks

  • Tabular
  • Timeseries
  • Parquet

droid_1.0.1_v30

4,442 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596,

  • Tabular
  • Text
  • Parquet

bridge_v2_lerobot

4,422 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,

community_dataset_v1

4,357 downloads · apache-2.0

Community Dataset v1 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 128 datasets from 55 contributors worldwide. We used this dataset to pretrain S

  • Video

EO-Data1.5M

4,302 downloads · apache-2.0

🤖 EO-Data-1.5M A Large-Scale Interleaved Vision-Text-Action Dataset for Embodied AI The first large-scale interleaved embodied dataset emphasizing temporal dynamics and causal dependencies among visi

  • Image
  • Text
  • Parquet

carla-dataset-ped

4,300 downloads · cc-by-4.0

CARLA Stage 2 Pedestrian Dataset A large-scale driving dataset (Stage 2) focused on pedestrians, captured from CARLA simulator. This dataset contains RGB images and depth maps with camera parameters,

so101_pick_place

4,286 downloads · mit

Pick object and place in box Project KIWI | Qian Group HRI Lab | University of Houston Detail Value Task Pick object and place in box Episodes 50 FPS 30 Cameras Gripper (Arducam) + Overhead (Intel Rea

  • Image
  • Imagefolder

Retargeted_AMASS_for_robotics

4,151 downloads · cc-by-4.0

Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and a

xarm_push_medium_replay

4,122 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

bridge_v2_lerobot_pathmask

3,958 downloads · apache-2.0

PEEK VLM-Labeled BRIDGE_v2 dataset This dataset contains the LeRobot-format BRIDGE-v2 dataset with paths and masks from the PEEK VLM drawn onto the image: PEEK: Guiding and Minimal Image Representatio

  • Video

Retargeted_AMASS_for_bxi_elf2

3,906 downloads · cc-by-4.0

Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and a

Transparent_BOP

3,902 downloads · cc-by-4.0

Object Pose Estimation Using Implicit Representation for Transparent Objects This dataset aggregates high quality 3D mesh assets and rendered data for training and fine-tuning pose estimation models.

dynamics-probing

3,873 downloads · mit

PhysProbe Dynamics Probing Dataset Manipulation episodes from Isaac Lab collected for probing physics understanding in video world models (V-JEPA 2, VideoMAE, DINOv2). Each episode includes dual-camer

  • Video

droid_failure

3,832 downloads · cc-by-4.0

DROID Failure Episodes Failure episodes in DROID-COMMUNITY. Episodes with inconsistent data were discarded, leaving ~91% of all failure episodes. Ported from raw 1.0.1 data at full resolution to LeRob

xarm_push_medium

3,669 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

libero_90_no_noops_lerobot

3,630 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 3921, "total_frames": 569249, "total_tasks": 73, "tota

  • Tabular
  • Timeseries
  • Parquet

xarm_lift_medium_replay

3,608 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

AgiBotWorld-Alpha-lerobot

3,607 downloads · apache-2.0

Note This is an unofficial implementation. This dataset was converted from AgiBotWorld-Alpha into the LeRobot format using https://github.com/Tavish9/any4lerobot/tree/main/agibot2lerobot Configuration

  • Tabular
  • Timeseries
  • Parquet

OmniAction-LIBERO

3,565 downloads · cc-by-nc-4.0

RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language

computer-use-large

3,547 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

bc_z_lerobot

3,535 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 1

  • Video

UrbanVerse-100K

3,530 downloads · odc-by

UrbanVerse-100K Dataset [!NOTE] UrbanVerse-100K is a large-scale, physics-aware 3D asset and material database curated for urban simulation, physical and embodied AI research. It contains over 102K me

  • 3d

computer-use-large

3,496 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

computer-use-large

3,488 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

Bench2Drive-Speed

3,478 downloads · cc-by-nc-nd-4.0

Bench2Drive-Speed Project Page | Paper | GitHub Bench2Drive-Speed is a closed-loop benchmark for desired-speed conditioned autonomous driving, enabling explicit control over vehicle behavior through t

seed

3,468 downloads · other

BONES-SEED: Skeletal Everyday Embodiment Dataset BONES-SEED (Skeletal Everyday Embodiment Dataset) is an open dataset of 142,220 annotated human motion animations for humanoid robotics. It provides mo

aloha_static_cups_open

3,443 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

computer-use-large

3,435 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

computer-use-large

3,389 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

computer-use-xl

3,362 downloads · cc-by-4.0

Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc

level2_final_quality2_augmented

3,360 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 2366, "total_frames": 6205242, "total_tasks

  • Tabular
  • Timeseries
  • Parquet

AMASS_Retargeted_for_G1

3,339 downloads · cc-by-4.0

Retargeted AMASS Dataset for G1 This repository contains 100% of the AMASS dataset retargeted to the G1 humanoid and formatted for use with IsaacLab's AMP motion loader. It should also be compatible w

fractal20220817_data_lerobot

3,307 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 5

FuSe

3,274 downloads · mit

The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments,

wm-eval-gt-ewmbench

3,268 downloads · cc-by-nc-sa-4.0

WM-Eval GT: EWMBench Ground Truth Ground truth data for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework. Source Extracted from agibot-world/EWMBen

  • Image
  • Imagefolder

aloha_mobile_cabinet

3,230 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 85, "total_frames": 127500, "total_tasks": 1, "chunks_s

  • Tabular
  • Timeseries
  • Parquet

community_dataset_v2

3,216 downloads · apache-2.0

Community Dataset v2 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 340 datasets from 117 contributors worldwide. This dataset represents the secon

  • Video

DR_dataset

3,206 downloads · mit

DR_dataset Simulated robot manipulation dataset for imitation learning. Dataset Description This dataset contains 200 episodes of a Trossen WXAI robotic arm performing food transfer tasks in MuJoCo si

oxford_spires_dataset

3,196 downloads · cc-by-nc-sa-4.0

We present the Oxford Spires Dataset, captured in and around well-known landmarks in Oxford using a custom-built multi-sensor perception unit as well as a millimetre-accurate map from a terrestrial Li

lerobot_derek_depth

3,160 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "chunks_s

  • Image
  • Timeseries
  • Parquet

SimScale

3,007 downloads · apache-2.0

Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Ti

LeWAM_community_dataset

3,000 downloads · apache-2.0

LeWAM Community Dataset A large-scale community-contributed robotics dataset for robot learning research featuring 321 datasets from the LeRobot community. Derived from Community Dataset v2, and conve

  • Video

agibot_alpha_v30

2,999 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AgiBot_A2D", "total_episodes": 28122, "total_frames": 47613574, "total_tasks": 30

  • Tabular
  • Text
  • Parquet

0402data

2,955 downloads · apache-2.0

This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 82092, "to

  • Tabular
  • Timeseries
  • Parquet

aloha_pen_uncap

2,951 downloads · apache-2.0

Dataset Card for aloha_pen_uncap This dataset is a FiftyOne conversion in LeRobot format of the aloha_pen_uncap_diverse subset of BiPlay. The aloha_pen_uncap_diverse subset is a task-specific segment

  • Image

berkeley-frodobots-lerobot-7k

2,937 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "frodobot", "total_episodes": 185761, "total_frames": 163360777, "total_tasks": 1,

  • Tabular
  • Timeseries
  • Parquet

robomme

2,887 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1600, "total_frames": 768897, "total_tasks": 116, "chun

  • Tabular
  • Text
  • Parquet

DOMINO

2,849 downloads · apache-2.0

Towards Generalizable Robotic Manipulation in Dynamic Environments Heng Fang1, Shangru Li1, Shuhan Wang1, Xuanyang Xi2, Dingkang Liang1, Xiang Bai1 1 Huazhong University of Science and Technology, 2 H

MarketGen

2,838 downloads · cc-by-nc-sa-4.0

MarketGen : A Scalable Simulation Platform with Auto-Generated Embodied Supermarket Environments 📑 arXiv | 🌐 Project | 🤗 Hugging Face MarketGen is a scalable simulation platform with automatic scen

  • Image

L2D-v3

2,811 downloads · apache-2.0

This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.

  • Video

agent-reward-bench

2,747 downloads · not specified

AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Mead

  • Image
  • Text
  • Csv

furniture_bench_dataset_lerobot

2,692 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "tota

  • Tabular
  • Timeseries

xarm_push_medium_image

2,661 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Image
  • Timeseries
  • Parquet

vlabench_primitive_ft_lerobot_video

2,655 downloads · mit

VLABench Primitive Tasks Dataset - LeRobot v3.0 Dataset Description This dataset is organized in the LeRobot v3.0 format and is used for integrating VLABench into the LeRobot framework officially. Com

  • Tabular
  • Timeseries
  • Parquet

xarm_lift_medium_replay_image

2,650 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Image
  • Timeseries
  • Parquet

ManiSkill_Demonstrations

2,644 downloads · apache-2.0

ManiSkill Demonstrations This dataset repo contains all of the latest ManiSkill demonstration datasets as well as some pretained model weights used to generate some demonstrations. To download by envi

xarm_lift_medium_image

2,643 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Image
  • Timeseries
  • Parquet

xarm_push_medium_replay_image

2,626 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_

  • Image
  • Timeseries
  • Parquet

droid_100

2,575 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 32212, "total_tasks": 47, "total

  • Tabular
  • Timeseries
  • Parquet

Task_0002_OrderPicking_lerobot

2,532 downloads · apache-2.0

This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "total_episodes": 857, "total_frames": 85474, "total_videos": 2828, "codebase_version": "v2.1", "robot

  • Tabular
  • Timeseries
  • Parquet

Bright2026_19229

2,515 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19229 (MVP) Competition task: 抓水果 (Fruit Grasping) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-d

SAGE-3D_Collision_Mesh

2,494 downloads · cc-by-nc-4.0

SAGE-3D Collision Mesh: Physics-Enabled Collision Bodies for 3D Gaussian Scenes Paper | Project Page | Code High-precision collision geometry dataset extracted from 1,000 indoor Mesh scenes, enabling

  • Image
  • Imagefolder

SNOW.SB

2,452 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 91, "total_frames": 73578, "total_tasks": 1, "c

  • Tabular
  • Timeseries
  • Parquet

libero-rlds

2,434 downloads · mit

Dataset Structure These datasets are used for MemoryVLA training. This is the standard LIBERO setting and can be directly used for other models as well.All data follow the RLDS format from the LIBERO

stationaray_sdk_edit_test_a

2,432 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Converted from TrossenMCAP format using the Trossen SDK trossen_mcap_to_lerobot_v2 tool. Dataset Structure meta/info.json: { "codebase_versi

  • Image
  • Tabular
  • Parquet

agibotworld-alpha

2,423 downloads · cc-by-sa-4.0

from datasets import Features, Value, Video, load_dataset features = Features({ "head_color.mp4": Video(), "proprio_stats.hdf5": Value("binary"), "task.json": { "episode_id": Value("int64"), "init_sce

taco_play_lerobot

2,377 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "tot

  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Brainco_Collect_Plates_Into_Dishwasher

2,347 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

dobbe_lerobot

2,275 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 5208, "total_frames": 1139911, "total_tasks": 7

  • Tabular
  • Timeseries
  • Parquet

D2E-480p

2,236 downloads · cc-by-nc-4.0

D2E-480p Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI. 268.7 hours of sy

  • Video

fractal-rlds

2,232 downloads · mit

Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Fractal datase

D2E-Original

2,230 downloads · cc-by-nc-4.0

D2E-Original Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI. 273.4 hours o

  • Video

SocialNav-SUB

2,197 downloads · mit

SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation This is the accompying dataset for the Social Navigation Scene Understanding Benchmark (SocialNav-SUB) which is a Vi

  • Image
  • Text
  • Csv

Retargeted_AMASS_for_FourierN1

2,174 downloads · cc-by-4.0

Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and a

berkeley_cable_routing

2,147 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1647, "total_frames": 42328, "total_tasks": 1, "chunk

  • Tabular
  • Timeseries
  • Parquet

berkeley_autolab_ur5_lerobot

2,118 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5", "total_episodes": 896, "total_frames": 87783, "total_tasks": 5, "total_vide

  • Tabular
  • Timeseries
  • Parquet

scannet-dataset

2,091 downloads · apache-2.0

This is ScanNet dataset containing indoor scenes which is used for 3d object detection, 3d segmentation, etc. Acknowledgement @inproceedings{dai2017scannet, title={ScanNet: Richly-annotated 3D Reconst

RoboSense

2,075 downloads · cc-by-sa-4.0

Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments Description RoboSense is a large-scale multimodal dataset constructed to facil

G1_Dex3_ToastedBread_Dataset

2,059 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

Master_wire_hook_20260402

2,046 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 638, "total_frames": 421684, "tota

  • Tabular
  • Timeseries
  • Parquet

Amelia42-Mini

2,011 downloads · bsd-2-clause

Dataset Overview The Amelia42-Mini dataset provides air traffic position reports for 42 major U.S. airports, including the following airports: KATL (Hartsfield-Jackson Atlanta International Airport) K

  • Tabular
  • Text
  • Csv

G1_WBT_Inspire_Put_Clothes_into_Washing_Machine

1,999 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

ARCH_master_wire_hook_20260402

1,998 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 703, "total_frames": 456261, "tota

  • Tabular
  • Timeseries
  • Parquet

dagger_final_1_21

1,993 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1, "total_frames": 841, "total_tasks": 1, "

  • Tabular
  • Timeseries
  • Parquet

so-combined-eng

1,947 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description The English version of this dataset integrates 598 open-source community datasets into a single unified corpus, comprising 22,709 episodes a

  • Tabular
  • Timeseries

ATG-MoE_TrainingSet

1,913 downloads · mit

ATG-MoE Pressure-Reducing Valve Assembly Training Set This repository provides the training set used in our paper: "ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly s

COMMAND

1,903 downloads · cc-by-nc-4.0

COMMAND Dataset 😃 🤗 Dataset Availability The evaluation data from this dataset is now available on Hugging Face: Repository: maum-ai/COMMAND Content: 60 evaluation scenarios (240 files, ~38GB) File

LIBERO-Para

1,870 downloads · mit

LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models LIBERO-Para is a controlled benchmark for evaluating the paraphrase robustness of Vision-Language-Action (VLA) m

MolmoAct-Midtraining-Mixture

1,858 downloads · cc-by-4.0

MolmoAct - Midtraining Mixture Data Mixture used for MolmoAct Midtraining. Contains MolmoAct Dataset formulated as Action Reasoning Data. MolmoAct is a fully open-source action reasoning model for rob

  • Image
  • Text
  • Parquet

G1_WBT_Brainco_Pickup_Pillow

1,855 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

simtos_one_item_can_pet_trim_0.2

1,844 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_

  • Tabular
  • Timeseries
  • Parquet

MesaTask-10K

1,842 downloads · not specified

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning 🔑Key Features MesaTask-10K, a large-scale dataset for task-oriented tabletop scene generation, comprises approximately

SandGO

1,825 downloads · apache-2.0

SandGO Dataset (v1.0) SandGO 是一个专为具身智能 (Embodied AI) 和机器人学习设计的 long-horizon 多模态数据集。该数据集由原始 MarsMind_data 经过精炼、去冗余和结构化处理后得到,旨在支持 长序列决策、指令跟随 和 多模态模仿学习 等任务。数据集包含完整的状态‑动作序列、多视角 RGB 图像、LiDAR 点云、深度点云 以及对应的

  • Image

SuSuInterActs

1,791 downloads · cc-by-4.0

SuSuInterActs Dataset A Large-Scale Multimodal Dialogue Corpus with Synchronized Speech, Full-Body Motion, and Facial Expressions From the paper: SentiAvatar: Towards Expressive and Interactive Digita

gs_scenes

1,779 downloads · apache-2.0

A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting Ziyuan Xia • Jingyi Xu • Chong Cui • Yuanhong Yu• Jiazhao Zhang • Qingsong Yan • Tao Ni Junbo Chen • Xiaowei Zhou • Hujun Bao • Rui

Bright2026_19382

1,773 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19382 (别碰我爪子) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56

  • Video

MetaFold

1,770 downloads · cc-by-nc-nd-4.0

Dataset Card for Dataset Name MetaFold Dataset is a point-cloud trajectory dataset designed for multi-category garment folding tasks in robotic manipulation. Dataset Details Dataset Description Curate

svla_so101_pickplace

1,761 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100_follower", "total_episodes": 50, "total_frames": 11939, "total_tasks": 1, "

  • Tabular
  • Timeseries
  • Parquet

level2_final_quality2

1,753 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1183, "total_frames": 3102621, "total_tasks

  • Tabular
  • Timeseries
  • Parquet

roboturk_lerobot

1,747 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "sawyer", "total_episodes": 1796, "total_frames": 168423, "total_tasks": 3, "total

  • Tabular
  • Timeseries
  • Parquet

L2D-v3

1,741 downloads · apache-2.0

This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.

  • Tabular
  • Text
  • Parquet

dk1_2026-03-01

1,653 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 141, "total_frames": 306425, "total_tasks": 1

  • Tabular
  • Timeseries
  • Parquet

LeWAM_community_dataset_preprocessed

1,652 downloads · apache-2.0

LeWAM Community Dataset (Preprocessed) A large-scale community-contributed robotics dataset for robot learning research featuring 318 datasets from the LeRobot community. Derived from Community Datase

  • Video

robocasa_depth_test

1,644 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 2, "total_frames": 309, "total_tasks": 1, "chunks_

  • Tabular
  • Timeseries
  • Parquet

level2_final_quality3_trim_0_hil_data

1,631 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks

  • Tabular
  • Timeseries
  • Parquet

Bright2026_19380

1,628 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19380 (MAEN) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-

VLA_Arena_L0_L_lerobot_openpi

1,589 downloads · apache-2.0

VLA-Arena Dataset (L0 - Large Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models. It provides a complete and u

  • Image
  • Timeseries
  • Optimized Parquet

berkeley_autolab_ur5

1,587 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 97939, "total_tasks": 5, "total

  • Tabular
  • Timeseries
  • Parquet

Bright2026_19361

1,568 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19361 (DynamicX) Competition task: 环套在柱子上 (Put Ring onto Rod) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimen

  • Video

BiPlay

1,563 downloads · mit

BiPlay contains 9.7 hours of bimanual data collected with an aloha robot at the RAIL lab @ UC Berkeley, USA. It contains 7023 clips, 2000 language annotations and 326 unique scenes. Paper: https://hug

jaco_play

1,541 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1085, "total_frames": 77965, "total_tasks": 89, "tota

  • Tabular
  • Timeseries
  • Parquet

libero-10-lerobot

1,539 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101499, "total_tasks": 10, "total_

  • Image
  • Timeseries
  • Parquet

SimScale

1,537 downloads · apache-2.0

Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Ti

  • Image
  • Text
  • Webdataset

HuMI-Raw-Data

1,511 downloads · cc-by-4.0

Raw Data for HuMI [Project Page] | [Paper] Dataset Summary This dataset contains raw data collected using the HuMI data collection pipeline. It includes MP4 videos and tracker trajectories. The datase

utaustin_mutex_lerobot

1,489 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "tota

  • Tabular
  • Timeseries
  • Parquet

level2_final_quality3_t_0_hil_data_c

1,488 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks

  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Robotics-Manipulation-Kitchen

1,477 downloads · cc-by-4.0

PhysicalAI Robotics Manipulation in the Kitchen Dataset Description: PhysicalAI-Robotics-Manipulation-Kitchen is a dataset of automatic generated motions of robots performing operations such as openin

  • Tabular
  • Timeseries
  • Parquet

bridge_orig_lerobot

1,477 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974,

  • Video

G1_Dex1_Pack_PingPong

1,476 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

usc_cloth_sim

1,471 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 100000, "total_tasks": 1, "chun

  • Tabular
  • Timeseries
  • Parquet

libero_90_lerobot

1,464 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 3958, "total_frames": 574553, "total_tasks": 89, "total

  • Image
  • Timeseries
  • Parquet

Bright2026_19234

1,451 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19234 (Apex) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-

libero_10_image

1,450 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks

  • Image
  • Timeseries
  • Parquet

0413

1,445 downloads · apache-2.0

This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 79017, "to

  • Tabular
  • Timeseries
  • Parquet

libero

1,440 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10. All datasets were taken fro

  • Image
  • Timeseries
  • Parquet

level2_final_quality3_trim

1,436 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1197, "total_frames": 3146294, "total_tasks

  • Tabular
  • Timeseries
  • Parquet

RoboInter-VQA

1,429 downloads · not specified

RobotInter-VQA: Intermediate Representation Understanding & Generation VQA Dataset for Manipulation English | 简体中文 A Visual Question Answering dataset for robotic manipulation, developed as part of th

LIBERO-plus

1,427 downloads · mit

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website | 🤗 Assets | 🤗 Model | 📁 Training Dataset 🔥 Overview This repository contains the offici

G1_Dex1_Fold_Towel

1,419 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

berkeley_rpt

1,417 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total

  • Tabular
  • Timeseries
  • Parquet

taco_play

1,415 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3603, "total_frames": 237798, "total_tasks": 406, "to

  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Inspire_Put_Clothes_Into_Basket

1,395 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

lerobot_so101_base_sim_pickplace

1,385 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 5, "total_frames": 1153, "total_tasks": 1, "total_video

  • Tabular
  • Timeseries
  • Parquet

GR1-Tabletop-NextState-300x24

1,385 downloads · apache-2.0

GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobo

  • Tabular
  • Video
  • Parquet

dk1_2026-03-03

1,346 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 132, "total_frames": 273043, "total_tasks": 1

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Stack_Block

1,343 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

jaco_play_lerobot

1,335 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "jaco_2", "total_episodes": 976, "total_frames": 70127, "total_tasks": 88, "total_

  • Tabular
  • Timeseries
  • Parquet

calvin_task_D_D_scale_100_lerobo_format_addmask_v3

1,329 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 5124, "total_frames": 308918, "total_tasks": 389, "tota

  • Image
  • Timeseries
  • Parquet

svla_so100_pickplace

1,321 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 50, "total_frames": 19631, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Pack_PencilBox

1,316 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

austin_sirius_dataset

1,301 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total

  • Tabular
  • Timeseries
  • Parquet

finish_sandwich

1,299 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 80, "total_frames": 70277, "total_tasks": 2, "

  • Tabular
  • Timeseries
  • Parquet

libero_spatial_image

1,293 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "chunks_

  • Image
  • Timeseries
  • Parquet

AAWR-DROID

1,290 downloads · mit

Active Perception AAWR Dataset in GRASP Lab Mock Kitchen Paper | Project page | Code Here is our full dataset (13.5GB) in DROID format: https://huggingface.co/datasets/Everloom/AAWR-DROID/ It includes

  • Image

SAGE-3D_VLN_Data

1,281 downloads · cc-by-4.0

SAGE-3D VLN Data: Vision-Language Navigation Dataset with Hierarchical Instructions Paper | Project Page | Code A comprehensive VLN dataset featuring 2 million trajectory-instruction pairs across 1,00

aera_semi_pnp_dr_02_04_2026

1,281 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 1345076, "total_tasks": 72, "ch

  • Image
  • Text
  • Parquet

G1_Dex1_Erase_Board

1,277 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

BAPData

1,275 downloads · cc-by-sa-4.0

BAPData (EFM-10 Benchmark) Project Website | Paper BAPData is a dataset introduced in the paper "Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark

  • Image

FuSe

1,273 downloads · mit

The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments,

toto_lerobot

1,263 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 902, "total_frames": 294139, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

koch_test

1,255 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "koch", "total_episodes": 51, "total_frames": 16602, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

bridge_v2_lerobot

1,247 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Converted BridgeV2 dataset to lerobot format. Downloaded from [https://rail.eecs.berkeley.edu/datasets/bridge_release/data/] (only demos_8_1

  • Tabular
  • Text
  • Parquet

libero_object_image

1,236 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 454, "total_frames": 66984, "total_tasks": 10, "chunks_

  • Image
  • Timeseries
  • Parquet

so-combined-ru

1,227 downloads · apache-2.0

Датасет создан при помощи библиотеки LeRobot. Описание датасета Русскоязычная версия данного датасета объединяет 598 открытых датасетов сообщества в единый унифицированный корпус, включающий 22 709 эп

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Clean_Table

1,220 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

stanford_kuka_multimodal_dataset

1,219 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3000, "total_frames": 149985, "total_tasks": 1, "tota

  • Tabular
  • Timeseries
  • Parquet

aloha_mobile_shrimp

1,211 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 18, "total_frames": 67500, "total_tasks": 1, "chunks_si

  • Tabular
  • Timeseries
  • Parquet

simtos_one_item_can_pet_trim_0.2

1,202 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_

  • Tabular
  • Timeseries
  • Parquet

iamlab_cmu_pickup_insert_lerobot

1,192 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 631, "total_frames": 146241, "total_tasks": 7, "total_

  • Tabular
  • Timeseries
  • Parquet

droid_1.0.1_trajectory_overlay

1,179 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 4444, "total_frames": 930500, "total_tasks": 1424, "tot

  • Image
  • Timeseries
  • Parquet

peract_lerobot_train

1,179 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1706, "total_frames": 71623, "total_tasks": 201, "total

  • Image
  • Timeseries
  • Parquet

musclemimic-retargeted

1,174 downloads · other

MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoFullBody musculoskeletal model, generated using General Motion Retargeting (GMR). Dataset Groups Group Motions Descript

TraceSpatial-Bench

1,159 downloads · apache-2.0

TraceSpatial-Bench: An Object-Centric 3D Trajectory Planning Benchmark Welcome to TraceSpatial-Bench, an object-centric 3D spatial trace planning benchmark provided by RoboTracer.TraceSpatial-Bench is

  • Image
  • Text
  • Optimized Parquet

SafeFlowMPC

1,156 downloads · mit

SafeFlowMPC Finetuning Dataset This repository contains the finetuning dataset used in the paper SafeFlowMPC: Predictive and Safe Trajectory Planning for Robot Manipulators with Learning-based Policie

  • Optimized Parquet

G1_WBT_Inspire_Collect_Clothes_MainCamOnly

1,143 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

TAMEn

1,138 downloads · cc-by-nc-sa-4.0

TAMEn: Tactile-Aware Manipulation Engine for Closed-Loop Data Collection in Contact-Rich Tasks 🎯 Overview TAMEn builds upon the UMI paradigm with key enhancements in multimodality, precision-portabil

stanford_robocook

1,131 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 2460, "total_frames": 112980, "total_tasks": 9, "tota

  • Tabular
  • Timeseries
  • Parquet

GR1-Tabletop-NextState-100x24

1,123 downloads · apache-2.0

GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobo

  • Tabular
  • Video
  • Parquet

VITRA-TeleData

1,120 downloads · mit

VITRA Teleoperation Dataset Dataset Summary This dataset contains real-world robot teleoperation demonstrations collected using a 7-DoF robotic arm equipped with a dexterous hand and a head-mounted RG

nyu_franka_play_dataset

1,109 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 456, "total_frames": 44875, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

OmniRetarget_Dataset

1,100 downloads · mit

OmniRetarget Dataset: Humanoid Loco-Manipulation & Scene Interaction Paper | Project Page This dataset contains motion trajectories of a G1 humanoid robot interacting with objects and complex terrains

G1_Dex1_DualRobot_Clean_Table

1,098 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

NitroGen

1,096 downloads · cc-by-nc-4.0

Website | Model | Dataset | Code | Paper NitroGen Dataset Dataset Description: The NitroGen dataset contains action annotations for publicly available gameplay videos. Specifically, we used an in-hous

VLA_Instruction_Tuning

1,095 downloads · not specified

This repository contains the VLA-IT dataset, a curated 650K-sample Vision-Language-Action Instruction Tuning dataset, and the SimplerEnv-Instruct benchmark. These are presented in the paper InstructVL

bridge_data_v2_teleop

1,092 downloads · cc-by-4.0

BridgeData V2 Teleoperated Demos Teleoperated demonstrations in BridgeData V2. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~86% of all teleoperated episode

NIRANJAN_wire_hook_20260331

1,092 downloads · not specified

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 151, "total_frames": 89646, "total

  • Tabular
  • Timeseries
  • Parquet

libero_spatial_no_noops_abs

1,081 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 415, "total_frames": 50744, "total_tasks": 10, "total_

  • Tabular
  • Timeseries
  • Parquet

libero_10

1,076 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks

  • Tabular
  • Timeseries
  • Parquet

level12_rac_2_2026-02-09

1,066 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 537, "total_frames": 1835987, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

aloha_pen_uncap_diverse

1,059 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description This dataset is a lerobot conversion of the aloha_pen_uncap_diverse subset of BiPlay. BiPlay contains 9.7 hours of bimanual data collected w

  • Image
  • Timeseries
  • Parquet

PhysicalAI-Robotics-GraspGen

1,056 downloads · cc-by-4.0

GraspGen: Scaling Sim2Real Grasping GraspGen is a large-scale simulated grasp dataset for multiple robot embodiments and grippers. We release over 57 million grasps, computed for a subset of 8515 obje

  • Text
  • Text

berkeley_cable_routing_lerobot

1,046 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1482, "total_frames": 38240, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

RoboTwin_put_bottles_dustbin_randomized

1,043 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 297520, "total_tasks": 500, "chunk

  • Image
  • Timeseries
  • Parquet

austin_sirius_dataset_lerobot

1,040 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total_

  • Tabular
  • Timeseries
  • Parquet

berkeley_gnm_recon

1,039 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 11834, "total_frames": 610907, "total_tasks": 1, "chu

  • Tabular
  • Timeseries
  • Parquet

PND_Adam-U_pick-simple

1,030 downloads · cc-by-4.0

This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Citation

  • Tabular
  • Timeseries
  • Parquet

stanford_hydra_dataset_lerobot

1,028 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total_

  • Tabular
  • Timeseries
  • Parquet

libero_cot

1,016 downloads · apache-2.0

This dataset was created using LeRobot. It contains embodied Chain-of-Thought (CoT) demonstrations for the LIBERO benchmark, featuring paired reasoning and action traces. It was curated as part of the

  • Image
  • Timeseries
  • Parquet

360DVO

1,013 downloads · cc-by-nc-sa-4.0

360DVO Dataset This is the official dataset from the paper 360DVO: Deep Visual Odometry for Monocular 360-Degree Camera, which is published on IEEE Robotics and Automation Letters (RA-L) 2026. Project

  • Image
  • Imagefolder

RoboTwin_clean

1,012 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 2500, "total_frames": 549787, "total_tasks": 2409, "chu

  • Image
  • Timeseries
  • Parquet

G1_WBT_Brainco_Make_The_Bed

1,002 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Bag_Insert

1,000 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Video

G1_WBT_Inspire_Pickup_Pillow_MainCamOnly

998 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Wipe_Table

989 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

cmu_play_fusion

985 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "tota

  • Tabular
  • Timeseries
  • Parquet

robocasa_target_human_unified

985 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "robocasa", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 50,

  • Tabular
  • Timeseries
  • Parquet

cmu_play_fusion_lerobot

980 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "total

  • Tabular
  • Timeseries
  • Parquet

pick_cube_60mm

980 downloads · apache-2.0

Pick & Place Cube Dataset Collected for Agibot G2 robot

  • Tabular
  • Text
  • Parquet

SynthVerse

972 downloads · cc-by-nc-sa-4.0

Viewer is explicitly configured to read the parquet split only. Citation If you find this dataset useful, please cite: @article{zhao2026SythnVerse, title={SynthVerse: A Large-Scale Diverse Synthetic D

  • Image
  • Text
  • Optimized Parquet

isaaclab-ur5-insertion-stacked-cuboid-randomized

972 downloads · apache-2.0

This dataset was created with the IsaacLab-WireManagement scripted agent. Total time: 45 minutes 56.85 seconds to generate 506 episodes. Visualize this dataset Dataset Structure Total episodes: 506 To

  • Tabular
  • Timeseries
  • Parquet

berkeley_gnm_sac_son

957 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 2955, "total_frames": 241059, "total_tasks": 1, "chun

  • Tabular
  • Timeseries
  • Parquet

stanford_hydra_dataset

949 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total

  • Tabular
  • Timeseries
  • Parquet

GoPro-Raw-Videos

948 downloads · mit

Raw GoPro Videos for Four Robotic Manipulation Tasks [Project Page] [Paper] [Code] [Models] [Processed Dataset] This repository contains raw GoPro videos of robotic manipulation tasks collected in-the

  • Video

gr1_pickup_compact_h264

948 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 150, "total_frames": 7800, "total_tasks": 1, "chunks_size"

  • Tabular
  • Timeseries
  • Parquet

nyu_door_opening_surprising_effectiveness

947 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 484, "total_frames": 20405, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

wildos

945 downloads · apache-2.0

WildOS Frontiers Dataset Dataset Description This dataset provides visual frontier annotations for outdoor long-range navigation, created for WildOS: Open-Vocabulary Object Search in the Wild. The ann

  • Image
  • Imagefolder

record-test-realsense-depth-14

927 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 499, "total_tasks": 1, "chunks

  • Image
  • Tabular
  • Parquet

berkeley_rpt_lerobot

913 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total_

  • Tabular
  • Timeseries
  • Parquet

GraspClutter6D

900 downloads · cc-by-sa-4.0

GraspClutter6D Dataset GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes Seunghyeok Back, Joosoon Lee, Kangmin Kim, Heeseon Rho, Geonhyup Lee, Rae

droid-1.0.1-preprocessed

899 downloads · apache-2.0

DROID 1.0.1 — Preprocessed Mirror A ready-to-use mirror of lerobot/droid_1.0.1 with two preprocessing fixes baked in: Fixed meta/episodes/*.parquet timestamps. The upstream release stores videos/*/fro

  • Video

ManiSkill2

894 downloads · apache-2.0

ManiSkill2 Data Update: ManiSkill 3 has been released https://github.com/haosulab/ManiSkill/. It uses different datasets than ManiSkill2 so the data here is not expected to transfer over ManiSkill2 is

syn_realDataset

893 downloads · mit

Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to

so101-sort-color-joint-angles-preprocessed

887 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 2, "total_frames": 3394, "total_tasks": 1, "total_videos":

  • Image
  • Tabular
  • Parquet

aloha_static_coffee

886 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

conq_hose_manipulation

880 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 139, "total_frames": 8277, "total_tasks": 3, "chunks_

  • Tabular
  • Timeseries
  • Parquet

MS-HAB-TidyHouse

875 downloads · mit

ManiSkill-HAB TidyHouse Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB TidyHouse. Dataset D

dk1-merge-2026-03

874 downloads · apache-2.0

DK-1 Merged Dataset This dataset was created using LeRobot. Dataset Description Merged and deduplicated DK-1 bimanual robot dataset. All source videos are re-encoded to 640x360 h264 with 14D joint-spa

  • Tabular
  • Timeseries
  • Parquet

BimanualUR5eExample

872 downloads · apache-2.0

Dataset Summary This dataset provides shards in the WebDataset format for fine-tuning RDT-2 or other policy models on bimanual manipulation. Each sample packs: a binocular RGB image (left + right wris

  • Image
  • Text
  • Webdataset

G1_Dex1_Pour_Medicine

867 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

fmb

859 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1804, "total_frames": 338188, "total_tasks": 24, "chu

  • Tabular
  • Timeseries
  • Parquet

so101-table-cleanup

850 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 1, "total_frames": 1358, "total_tasks": 1, "to

  • Tabular
  • Timeseries
  • Parquet

aloha_static_fork_pick_up

846 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 60000, "total_tasks": 1, "chunks

  • Tabular
  • Timeseries
  • Parquet

droid-cube-stacking-order

835 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Panda", "total_episodes": 108, "total_frames": 44097, "total_tasks": 1, "total_vi

  • Tabular
  • Timeseries
  • Parquet

piper_pickdrop_fooditems_101

831 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120066, "total_tasks": 1, "total_v

  • Video

MOSAIC_Dataset

826 downloads · cdla-permissive-2.0

MOSAIC Dataset Project Page | Paper | Code | Dataset | Model This repository releases the built-in MOSAIC multi-source motion dataset in the following paper: MOSAIC: Bridging the Sim-to-Real Gap in Ge

  • Document
  • Text
  • Text

Master_wire_bend_varying_3wire_20260401

811 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 290, "total_frames": 814263, "total_tasks": 7, "

  • Tabular
  • Timeseries
  • Parquet

test

809 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 356, "total_frames": 103656, "total_tasks": 16, "chunk

  • Tabular
  • Timeseries
  • Parquet

berkeley_mvp

805 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_

  • Tabular
  • Timeseries
  • Parquet

Robotwin2.0-lerobot

805 downloads · apache-2.0

RoboTwin2.0 - LeRobot v3.0 RoboTwin2.0 converted to LeRobot v3.0 format. Contains both joint-space and end-effector (EE) pose data for bimanual manipulation across 5 robot embodiments. Dataset Statist

austin_sailor_dataset_lerobot

803 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total_

  • Tabular
  • Timeseries
  • Parquet

Master_wire_hook_screwing_20260408

803 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 235, "total_frames": 224240, "tota

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Organize_Tools

802 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_MountCameraRedGripper_Dataset

796 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Gripper", "total_episodes": 201, "total_frames": 172649, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

pick_cube_put_in_the_box_img3

789 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 100, "total_frames": 147066, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

xvla-soft-fold

788 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Repository: X-VLA License: Apache 2.0 Paper: Zheng et al., 2025, “X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Langu

  • Tabular
  • Timeseries
  • Parquet

ROVER

784 downloads · mit

The ROVER Visual SLAM Benchmark Paper News [2024/12/05] Initial code release. [2025/05/20] ROVER is accepted to IEEE Transactions on Robotics. [2025/05/25] Dataset released on HuggingFace, see Utility

refiner-utn-pick-cuboid-sim-v1

783 downloads · apache-2.0

Simulated Franka Pick-Cube Tactile Dataset The dataset was generated using the Robot Control Stack (RCS). RCS is a flexible Gymnasium wrapper-based robot control interface made for robot learning and

  • Image
  • Timeseries
  • Parquet

data

767 downloads · mit

Dataset Card for BEHAVIOR Robot Suite (BRS) Data This dataset provides robotic trajectories for five real-world household tasks. These tasks are: Clean house after a wild party; Clean the toilet; Take

viola_lerobot

763 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 135, "total_frames": 68913, "total_tasks": 3, "total_v

  • Tabular
  • Timeseries
  • Parquet

embodied_features_and_demos_libero

758 downloads · mit

Dataset for Embodied Chain-of-Thought Reasoning for LIBERO-90, as used by ECoT-Lite. TFDS Demonstration Data The TFDS dataset contains successful demonstration trajectories for LIBERO-90 (50 trajector

dronedata_new_lerobot

755 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "uav", "total_episodes": 799, "total_frames": 79900, "total_tasks": 2, "total_vide

  • Image
  • Timeseries
  • Parquet

PhysicalAI-Robotics-NuRec

751 downloads · cc-by-4.0

Dataset Description The Physical AI NuRec dataset seeks to empower robotic researchers to build the next generation of physical AI based end-to-end robotic models. This dataset includes various 3DGUT

  • 3d
  • Image

libero_merged

751 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunk

  • Image
  • Timeseries
  • Optimized Parquet

colosseum-challenge

739 downloads · mit

Colosseum Dataset Card This dataset contains demonstrations for training and testing Imitation Learning based policies, taken from our simulation benchmark Colosseum, which is based on RLBench. The be

bin_pick_pack_coffee_capsules

739 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 200, "total_frames": 47865, "total_tasks": 1, "

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_DiverseManip_DualArm_128x128

736 downloads · apache-2.0

This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018]. If you wa

  • Tabular
  • Timeseries
  • Parquet

community_dataset_v1

731 downloads · apache-2.0

Community Dataset v1 (v3.0) A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 119 datasets from 52 contributors worldwide. This is a converted and cur

ucsd_pick_and_place_dataset

726 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1355, "total_frames": 67750, "total_tasks": 3, "total

  • Tabular
  • Timeseries
  • Parquet

act_table_to_tray

724 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 50, "total_frames": 52868, "total_tasks": 1, "

  • Tabular
  • Timeseries
  • Parquet

OceanGym

714 downloads · mit

🌊 OceanGym 🦾 A Benchmark Environment for Underwater Embodied Agents 🌐 Home Page 📄 ArXiv Paper 🤗 Hugging Face ☁️ Google Drive ☁️ Baidu Drive OceanGym is a high-fidelity embodied underwater environ

behavior1k-only-rgb

709 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t

  • Tabular
  • Timeseries
  • Parquet

cmu_stretch

707 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "total_

  • Tabular
  • Timeseries
  • Parquet

libero_spatial_no_noops_lerobot_v21

698 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "total_

  • Tabular
  • Timeseries
  • Parquet

aloha_static_candy

695 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

trlc_tshirt_folding

694 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 532, "total_frames": 1106053, "total_tasks":

  • Tabular
  • Timeseries
  • Parquet

humanoid-everyday

691 downloads · apache-2.0

Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world roboti

  • Video

G1_WBT_Inspire_Put_Clothes_into_Washing_Machine_MainCamOnly

687 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

connect4_large-1-recut

686 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 108, "total_frames": 16645, "total_tasks": 1, "ch

  • Tabular
  • Timeseries
  • Parquet

Bright2026_19359

683 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19359 (炬亮启航) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-

G1_Dex3_BlockStacking_Dataset

679 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

libero_goal_image

678 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 428, "total_frames": 52042, "total_tasks": 10, "chunks_

  • Image
  • Timeseries
  • Parquet

IndustryShapes

675 downloads · mit

IndustryShapes Project Page | Paper IndustryShapes is a new benchmark dataset tailored for 6D object pose estimation in industrial settings. Targeting the challenges of textureless objects, reflective

  • Image
  • Text
  • Parquet

so101_pick_cube_chunked

672 downloads · apache-2.0

SO101 Pick Cube Dataset (Chunked) This is a restructured version of the gpudad/so101_pick_cube dataset with episode-level video files for faster data loading during training. Why Chunked? The original

  • Tabular
  • Video
  • Parquet

robocasa_target

671 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 12

  • Tabular
  • Video
  • Parquet

ucsd_kitchen_dataset

668 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 150, "total_frames": 3970, "total_tasks": 8, "total_v

  • Tabular
  • Timeseries
  • Parquet

utaustin_mutex

668 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "tot

  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-Robotics-Manipulation-Objects

668 downloads · not specified

PhysicalAI-Robotics-Manipulation-Objects is a dataset of automatic generated motions of robots performing operations such as picking and placing objects in a kitchen environment. The dataset was gener

  • Tabular
  • Timeseries
  • Parquet

behavior1k

662 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "R1Pro", "total_episodes": 2000, "total_frames": 21227314, "total_tasks": 10, "chu

  • Tabular
  • Timeseries
  • Parquet

droid_1.0.1_test

662 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630,

  • Tabular
  • Text
  • Parquet

so101-puzzle-v4

658 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 37, "total_frames": 36234, "total_tasks": 1, "chu

  • Tabular
  • Timeseries
  • Parquet

Bright2026_19353

658 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19353 (模型学的全队) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (5

  • Video

onetwovla-dataset

655 downloads · not specified

Datasets for OneTwoVLA [Project Page] | [Paper] | [Code] This repository provides datasets collected with the UMI, converted into the LeRobot data format, along with synthetic vision-language data use

  • Image
  • Timeseries
  • Parquet

functional-manipulation-benchmark

654 downloads · cc-by-4.0

Functional Manipulation Benchmark This robot learning dataset is a part of the paper "FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning". It includes 22,550 expert demonstrat

aloha_static_battery

652 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 29400, "total_tasks": 1, "chunks_

  • Tabular
  • Timeseries
  • Parquet

PND_Adam-U_pick-simple_speed2x

650 downloads · cc-by-4.0

This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Citation

  • Tabular
  • Timeseries
  • Parquet

berkeley_fanuc_manipulation

649 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "total

  • Tabular
  • Timeseries
  • Parquet

Griffin

647 downloads · mit

Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark 📄 arXiv | 🐙 GitHub | 💾 Baidu Netdisk | 🤗 Hugging Face Griffin is the pioneering publicly available dataset for aeria

navitrace

645 downloads · cc-by-4.0

NaviTrace 🏠 Project 📄 Paper 💻 Code 🏆 Leaderboard NaviTrace is a novel VQA benchmark for VLMs that evaluates models on their embodiment-specific understanding of navigation across challenging real-

  • Image
  • Text
  • Parquet

umi_cup_in_the_wild

642 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1447, "total_frames": 699432, "total_tasks": 1, "chun

  • Tabular
  • Timeseries
  • Parquet

G1_WBT_Inspire_Put_Vegetables_Into_Basket

641 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

Z1_Dual_Dex1_StackBox_Dataset_V2

638 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Dual", "total_episodes": 254, "total_frames": 178104, "total_tasks": 1

  • Tabular
  • Timeseries
  • Parquet

koch_pick_place_1_lego

636 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 39693, "total_tasks": 1, "chunks

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_Prepare_Fruit

634 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

utokyo_pr2_tabletop_manipulation

633 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 32708, "total_tasks": 3, "total_

  • Tabular
  • Timeseries
  • Parquet

thanos_picking_power_gem

633 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 51, "total_frames": 16267, "total_tasks": 1, "

  • Tabular
  • Timeseries
  • Parquet

libero_plus_rlds

631 downloads · mit

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website 🔥 Overview This repository contains the official implementation and benchmark for our paper

koch_pick_place_5_lego_random_pose

627 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 36013, "total_tasks": 1, "chunks_

  • Tabular
  • Timeseries
  • Parquet

taco_play_lerobot

625 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "tot

  • Video

dk1_duplo_in_box_one_2026-04-09

623 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks

  • Tabular
  • Timeseries
  • Parquet

nyu_franka_play_dataset_lerobot

622 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 365, "total_frames": 34448, "total_tasks": 1, "total_v

  • Tabular
  • Timeseries
  • Parquet

Arena-G1-Loco-Manipulation-Task

622 downloads · cc-by-4.0

Dataset Description: The Arena-G1-Loco-Manipulation-Task dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (G1) loco-manipulation task in IsaacLab-Arena en

  • Tabular
  • Timeseries
  • Parquet

dk1_2026-03-02

622 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 61, "total_frames": 120265, "total_tasks": 1,

  • Tabular
  • Timeseries
  • Parquet

UniHand_Preview

621 downloads · not specified

This data is a subset of the pretraining data for Being-H0.5. Citation Being-H0.5 @article{beingbeyond2026beingh05, title={Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment General

  • Text
  • Webdataset

aloha_static_vinh_cup

616 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 101, "total_frames": 45500, "total_tasks": 1, "total_vi

  • Tabular
  • Timeseries
  • Parquet

roboturk

613 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1995, "total_frames": 187507, "total_tasks": 3, "chun

  • Tabular
  • Timeseries
  • Parquet

so101-puzzle-v9

611 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 53, "total_frames": 47059, "total_tasks": 1, "chu

  • Tabular
  • Timeseries
  • Parquet

cmu_franka_exploration_dataset

610 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 199, "total_frames": 1990, "total_tasks": 3, "total_v

  • Tabular
  • Timeseries
  • Parquet

mimic_tictactoe_redx_full30hz

608 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 554, "total_frames": 428498, "total_tasks": 11

  • Tabular
  • Timeseries
  • Parquet

reachy-pick-and-place-images

597 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "reachy2", "total_episodes": 25, "total_frames": 6652, "total_tasks": 1, "chunks_s

  • Tabular
  • Timeseries
  • Parquet

GEMBench

596 downloads · mit

Dataset Card for GEMBench dataset 💎 GEneralizable vision-language robotic Manipulation Benchmark Dataset A benchmark to systematically evaluate generalization capabilities of vision-and-language robo

musclemimic-bimanual-retargeted

587 downloads · other

MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoBimanualArm musculoskeletal model, generated using General Motion Retargeting (GMR). Dataset Groups Group Motions Descr

libero_goal_no_noops_abs

586 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 413, "total_frames": 49801, "total_tasks": 10, "total_

  • Video

toto

583 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1003, "total_frames": 325699, "total_tasks":1, "total

  • Tabular
  • Timeseries
  • Parquet

minecraft-vla-stage1

583 downloads · mit

Minecraft VLA Stage 1: Action Pretraining Data Vision-Language-Action training data for Minecraft, processed from OpenAI's VPT contractor dataset. Dataset Description This dataset contains frame-actio

  • Text
  • Parquet

task_16_upload_1

580 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t

  • Tabular
  • Timeseries
  • Parquet

ArtVIP

575 downloads · apache-2.0

ArtVIP dataset card Key Features ✅ 206 high-quality digital-twin articulated objects. ✅ 6 pre-configured digital-twin scenes, 6 user-defined scenes ✅ Reusable Modular interaction ✅ Physics fidelity ✅

level1_final_quality0

566 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 631, "total_frames": 1055899, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

aloha_static_screw_driver

564 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

aloha_static_towel

564 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "chunks_si

  • Tabular
  • Timeseries
  • Parquet

berkeley_gnm_cory_hall

564 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 7331, "total_frames": 156012, "total_tasks": 1, "chun

  • Tabular
  • Timeseries
  • Parquet

Master_wire_bend_varying_3wire_20260319

563 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "c

  • Tabular
  • Timeseries
  • Parquet

aloha_static_vinh_cup_left

562 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 100, "total_frames": 50000, "total_tasks": 1, "total_vi

  • Tabular
  • Timeseries
  • Parquet

so101_gba_direct_manipulation

562 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_leader", "total_episodes": 200, "total_frames": 47567, "total_tasks": 1, "c

  • Tabular
  • Timeseries
  • Parquet

imperialcollege_sawyer_wrist_cam

561 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 170, "total_frames": 7148, "total_tasks": 17, "chunks

  • Tabular
  • Timeseries
  • Parquet

reactive_diffusion_policy_dataset

561 downloads · mit

Dataset of Reactive Diffusion Policy Contents Description Structure Usage Tactile Dataset Description This is the raw and postprocessed dataset used in the paper Reactive Diffusion Policy: Slow-Fast V

  • Video

bridge_train_0_5000_augmented

555 downloads · cc-by-4.0

bridge_train_0_5000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,417 Splits: train: 0

  • Tabular
  • Text
  • Parquet

libero_10_subtasks

555 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "libero_panda", "total_episodes": 500, "total_frames": 138090, "total_tasks": 10,

  • Image
  • Text
  • Parquet

RBench

554 downloads · cc-by-4.0

Rethinking Video Generation Model for the Embodied World 🔍 Benchmark Overview The benchmark is constructed from two complementary perspectives: task categories and robot embodiment types, covering a

nemotron-vla-metaworld

549 downloads · mit

Nemotron-VLA MetaWorld Expert Demonstrations Expert demonstration dataset for training Vision-Language-Action (VLA) models on MetaWorld robot manipulation tasks. Built for the Nemotron-VLA project. Da

  • Text
  • Timeseries
  • Parquet

blueberry-kitchen

539 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "blueberry_ros", "total_episodes": 80, "total_frames": 29986, "total_tasks": 1, "c

  • Tabular
  • Timeseries
  • Parquet

Humanoid-X

536 downloads · not specified

Humanoid-X 🌐 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repo contains the officail dataset for the paper "Learning from Massive Human Videos for Universal Humanoid Pose Control" If yo

  • Text
  • Text

UrbanNav

533 downloads · mit

UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories Yanghong Mei*1,5, Yirong Yang*2, Longteng Guo†1, Qunbo Wang3, Ming-Ming Yu2, Xingjian He1, Wenjun Wu2,4, Jing Liu1

berkeley_mvp_lerobot

531 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_vid

  • Tabular
  • Timeseries
  • Parquet

columbia_cairlab_pusht_real

530 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 136, "total_frames": 27808, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

kaist_nonprehensile

529 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 201, "total_frames": 32429, "total_tasks": 193, "tota

  • Tabular
  • Timeseries
  • Parquet

nyu_door_opening_surprising_effectiveness_lerobot

528 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 435, "total_frames": 18196, "total_tasks": 1, "

  • Tabular
  • Timeseries
  • Parquet

aloha_static_battery_ep005_009

527 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_si

  • Tabular
  • Timeseries
  • Parquet

aloha_static_battery_ep000_004

524 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_si

  • Tabular
  • Timeseries
  • Parquet

metaworld_mt50_jpeg

523 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c

  • Image
  • Timeseries
  • Parquet

record-test-lab

523 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "cta_ur_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "tota

  • Tabular
  • Timeseries
  • Parquet

aera_semi_pnp_dr_02_04_2026_skip3_delta_no_go_home

509 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 337627, "total_tasks": 72, "chu

  • Image
  • Text
  • Parquet

dk1_duplo_in_box_2026-04-08

509 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks

  • Tabular
  • Timeseries
  • Parquet

insert-blender

507 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 100, "total_frames": 55127, "total_tasks": 10, "total_vide

  • Tabular
  • Timeseries
  • Parquet

aloha_static_tape

505 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "chunks_

  • Tabular
  • Timeseries
  • Parquet

koch_bimanual_folding

502 downloads · not specified

This dataset was created using 🤗 LeRobot.

Bright2026_19159

500 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19159 (Robot大王) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (

aloha_mobile_chair

499 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 55, "total_frames": 110000, "total_tasks": 1, "total_vi

  • Tabular
  • Timeseries
  • Parquet

utokyo_xarm_pick_and_place

499 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 7490, "total_tasks": 1, "chunks_

  • Tabular
  • Timeseries
  • Parquet

level12_quality0_2026-02-08

499 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 258, "total_frames": 595888, "total_tasks":

  • Tabular
  • Timeseries
  • Parquet

aloha_static_ziploc_slide

497 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 56, "total_frames": 16800, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

aloha_static_pro_pencil

496 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 25, "total_frames": 8750, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

G1_Brainco_GraspOreo_Dataset

496 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 201, "total_frames": 234959, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

aloha_static_thread_velcro

493 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 34300, "total_tasks": 1, "chunks_

  • Tabular
  • Timeseries
  • Parquet

G1_Dex3_ObjectPlacement_Dataset

491 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

sfp_dataset

488 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ur5e_aic", "total_episodes": 50, "total_frames": 91714, "total_tasks": 1, "chunks

  • Tabular
  • Timeseries
  • Parquet

aloha_mobile_wash_pan

486 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

metaworld_mt50

483 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c

  • Image
  • Timeseries
  • Parquet

G1_WBT_Inspire_Put_Drinks_Into_Fridge

481 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

eval_bi_so101_round1_4cam

479 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 601, "total_frames": 637163, "total_tasks": 1,

  • Tabular
  • Text
  • Parquet

so101_unplug_cable_4

478 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 10, "total_frames": 4539, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

coop_alldata

478 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 980, "total_frames": 106944, "total_tasks": 10, "total_

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_DiverseManip_DualArm_256x256

476 downloads · apache-2.0

This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018]. If you wa

  • Tabular
  • Timeseries
  • Parquet

berkeley_fanuc_manipulation_lerobot

474 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "fanuc_mate", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "to

  • Tabular
  • Timeseries
  • Parquet

GATE-VLAP-datasets

474 downloads · not specified

GATE-VLAP Datasets Grounded Action Trajectory Embeddings with Vision-Language Action Planning This repository contains preprocessed datasets from the LIBERO benchmark suite in WebDataset TAR format, s

  • Image
  • Text
  • Webdataset

hamburger_heti

472 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 598, "total_frames": 285571, "total_tasks": 9, "total_v

  • Tabular
  • Timeseries
  • Parquet

piper_pickdrop_fooditems_100

472 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120081, "total_tasks": 1, "total_v

  • Video

bi_red_tomato

471 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 304, "total_frames": 302309, "total_tasks": 1,

  • Tabular
  • Timeseries
  • Parquet

leisaac-pick-orange

468 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 60, "total_frames": 36293, "total_tasks": 1, "

  • Tabular
  • Timeseries
  • Parquet

community_dataset_v3

464 downloads · apache-2.0

Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets acro

pick_block

463 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 40, "total_frames": 22683, "total_tasks": 1, "chu

  • Tabular
  • Timeseries

av_aloha_sim_cube_transfer

462 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 200, "total_frames": 30000, "total_tasks": 1, "total_video

  • Tabular
  • Timeseries
  • Parquet

openarm_pick_v6

462 downloads · apache-2.0

OpenArm Pick v6 - LeRobot Dataset A LeRobot v2.1 dataset for bimanual robot manipulation, recorded in Isaac Sim with teleoperation. Dataset Description This dataset contains teleoperated demonstration

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_DiverseManip_SingleArm_128x128

458 downloads · apache-2.0

This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018]. If you wa

  • Tabular
  • Timeseries
  • Parquet

Z1_StackBox_Dataset

457 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Single", "total_episodes": 1265, "total_frames": 989335, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

dAgger_bin_pick_pack_coffee_capsules_1.5.0

456 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 80, "total_frames": 24266, "total_tasks": 1, "c

  • Tabular
  • Timeseries
  • Parquet

psi0-apple-to-plate-teleop

451 downloads · apache-2.0

Psi0 Apple-to-Plate VR Teleoperation Dataset Human-demonstrated loco-manipulation trajectories for fine-tuning the Psi0 Vision-Language-Action (VLA) model on a Unitree G1 humanoid robot in Isaac Lab s

  • Tabular
  • Video
  • Parquet

robomimic-pretrain-data

450 downloads · not specified

From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning (DICE-RL) This repository contains the datasets used in the paper From Prior to Pro: Efficient Skill Mastery via D

level2_final_quality3

450 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks

  • Tabular
  • Timeseries
  • Parquet

teleop-piper-0411

450 downloads · apache-2.0

teleop-piper-0411 Teleoperation dataset for the Agilex Piper robotic arm, collected April 11, 2026 across 3 rigs. Dual-camera (ego wrist + exo third-person), LeRobot v2 format. Overview Episodes 207 T

  • Video

sd

446 downloads · mit

Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to

metaworld_mt50

446 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c

  • Image
  • Timeseries
  • Parquet

Master_wire_bend_varying_3wire_20260327

445 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 221, "total_frames": 638395, "total_tasks": 7, "

  • Tabular
  • Timeseries
  • Parquet

droid_1.0.1_v30_compact

443 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596,

  • Tabular
  • Text
  • Parquet

mimic_ttt_redx_30hz_x2_ALL

443 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 338, "total_frames": 364063, "total_tasks": 11

  • Tabular
  • Timeseries
  • Parquet

pick_cube_put_in_the_box

443 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "t

  • Tabular
  • Timeseries
  • Parquet

libero_spatial_cot

442 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "total

  • Image
  • Timeseries
  • Parquet

Processed-Task-Dataset

441 downloads · not specified

Robotic Manipulation Datasets for Four Tasks [Project Page] [Paper] [Code] [Models] [Raw GoPro Videos] This repository contains in-the-wild robotic manipulation datasets collected using UMI, and proce

aloha_mobile_wipe_wine

433 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 65000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

aloha_sim_demos

432 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "aloha", "total_episodes": 10, "total_frames": 2568, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

hamburger

431 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 390, "total_frames": 219625, "total_tasks": 19, "total_

  • Tabular
  • Timeseries
  • Parquet

assemble_box_with_phone_stand_test

431 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_flexiv_rizon4_rt", "total_episodes": 35, "total_frames": 315631, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

human2locoman

426 downloads · mit

Human2LocoMan: Learning Versatile Quadrupedal Manipulation with Human Pretraining Yaru Niu1,* Yunzhe Zhang1,* Mingyang Yu1 Changyi Lin1 Chenhao Li1 Yikai Wang1 Yuxiang Yang2 Wenhao Yu2 Tingnan Zhang2

  • Video

CRCD

423 downloads · cc-by-4.0

Comprehensive Robotic Cholecystectomy Dataset (CRCD) The Comprehensive Robotic Cholecystectomy Dataset (CRCD) is a large-scale, multimodal dataset for robot-assisted surgery (RAS) research.It provides

  • Image
  • Text
  • Parquet

dexmimicgen_datasets

420 downloads · cc-by-nc-sa-4.0

DexMimicGen Datasets This repository contains the official dataset release of simulation environments and datasets for the ICRA 2025 paper "DexMimicGen: Automated Data Generation for Bimanual Dexterou

  • Image
  • Imagefolder

AgiBotWorld-Beta_G1_task_463_Clean_the_microwave_and_range_hood

419 downloads · not specified

agibot_task_463 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 清洁微波炉和抽油烟机。 total_episodes: 1780 total_tasks: 1 size: 57G

  • Video

AgiBotWorld-Beta_G1_task_537_make_the_bed

411 downloads · not specified

agibot_task_537 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 整理床铺 total_episodes: 751 total_tasks: 1 size: 64G Dataset

  • Video

furniture_bench_dataset_lerobot

411 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "chun

  • Tabular
  • Timeseries
  • Parquet

G1_Dex3_GraspSquare_Dataset

410 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

1stpass10_splatsim_approach_lever_benchmark_1000_fast

410 downloads · apache-2.0

notes to self: this is using the faster teleop (aka no velocity limiting) This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lerobot

  • Image
  • Timeseries
  • Optimized Parquet

humanoidgen_dataset

409 downloads · not specified

HumanoidGen-Dataset 🔥 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repository provides datasets and resources for the HumanoidGen framework, enabling bimanual dexterous manipulation res

MS-HAB-SetTable

408 downloads · mit

ManiSkill-HAB SetTable Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB SetTable. Dataset Det

austin_buds_dataset

407 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 34112, "total_tasks": 1, "total_v

  • Tabular
  • Timeseries
  • Parquet

rbo_oxe_base_language_table_lerobot

407 downloads · apache-2.0

Language Table (LeRobot) — Task-Pruned, Reindexed Subset This release is a task-pruned subset of the original IPEC-COMMUNITY/language_table_lerobot. We subsampled by task text and rebuilt the package

humanoid-robots-training-dataset

407 downloads · cc-by-nc-4.0

Dynamic Intelligence — Humanoid Robot Training Dataset A first-person (egocentric) video dataset of human hand manipulation, designed for training humanoid robot policies via imitation learning. Each

  • Tabular
  • Text
  • Parquet

ur10e_real_cube_merge

406 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ros2", "total_episodes": 357, "total_frames": 334477, "total_tasks": 4, "chunks_s

  • Tabular
  • Timeseries
  • Parquet

dk1_duplo_in_box_two_2026-04-09

406 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks

  • Tabular
  • Timeseries
  • Parquet

bridge_train_20000_25460_augmented

404 downloads · cc-by-4.0

bridge_train_20000_25460_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,460 Frames: 185,428 Splits: tra

  • Tabular
  • Text
  • Parquet

ffw_sh5_dataset

403 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ffw_sh5", "total_episodes": 2, "total_frames": 184, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

aloha_static_pingpong_test

402 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 10, "total_frames": 6000, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

thermal_cycler_close-mujoco

400 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 106292, "total_tasks": 1, "total_vi

  • Tabular
  • Timeseries
  • Parquet

libero_jpeg

399 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunk

  • Image
  • Timeseries
  • Parquet

EMMOE-100

397 downloads · apache-2.0

EMMOE-100 Trainset Resources Project Paper Code Model Dataset Dataset Feature Task Attributes Task Example Dataset Structure EMMOE-100/ ├── README.md ├── assets/ ├── data/ │ └── train/ │ ├── 1/ │ │ ├─

  • Image
  • Text
  • JSON

vla_demo

396 downloads · apache-2.0

This dataset is used for the demonstration of the Vision Language Action model finetuning. It is collected using a modified version of LeKiwi with 3 cameras, but technically, only the arm is used, so

  • Video

G1_WBT_Dex1_Put_Clothes_into_Washing_Machine

394 downloads · apache-2.0

Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the

  • Tabular
  • Timeseries
  • Parquet

rbm-1m-ood

390 downloads · apache-2.0

RBM-1M-OOD evaluation dataset used in Robometer. It contains over 1k trajectories used for evaluation of general-purpose reward models. Dataset Description Official evaluation in the paper uses only t

  • Text
  • Video

mimic_simpleClothFolding_v1

388 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 222, "total_frames": 384490, "total_tasks": 1,

  • Tabular
  • Timeseries
  • Parquet

utokyo_pr2_opening_fridge

387 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 80, "total_frames": 11522, "total_tasks": 1, "total_v

  • Tabular
  • Timeseries
  • Parquet

MoveBlock

386 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 2, "total_frames": 1731, "total_tasks":1, "total_videos

  • Tabular
  • Timeseries
  • Parquet

leisaac-pick-orange-prepared

386 downloads · apache-2.0

LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original? The original dataset

  • Video

so101_unplug_cable_3

383 downloads · not specified

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 49, "total_frames": 23490, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

merged_diffrenet_grips

383 downloads · apache-2.0

merged_diffrenet_grips Dataset for RECAP (RL with Experience and Corrections via Advantage-conditioned Policies) training. Features Feature Type Shape action float32 (14,) observation.state float32 (1

  • Tabular
  • Timeseries
  • Parquet

aloha_static_coffee_new

382 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 75000, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

utokyo_xarm_bimanual

382 downloads · cc-by-4.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 70, "total_frames": 1514, "total_tasks": 2, "total_vi

  • Tabular
  • Timeseries
  • Parquet

G1_Dex1_MountCamera_Dataset

381 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

austin_sailor_dataset

378 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total

  • Tabular
  • Timeseries
  • Parquet

svla_so100_stacking

378 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100", "total_episodes": 56, "total_frames": 22956, "total_tasks": 1, "chunks_si

  • Tabular
  • Timeseries
  • Parquet

leisaac-pick-orange-prepared

375 downloads · apache-2.0

LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original? The original dataset

  • Video

RPD-maniskill

374 downloads · apache-2.0

Maniskill Sub-Dataset in RLDS Format used in RPD This repository contains the maniskill subset in RLDS format used to train Octo and OpenVLA in the paper Refined Policy Distillation: From VLA Generali

franka-fr3-manipulation

373 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 101, "total_frames": 21006, "total_tasks": 4, "chunks_

  • Image
  • Tabular
  • Parquet

Lightwheel-Tasks-X7S

372 downloads · apache-2.0

This dataset was created using LeRobot. 1 Dataset Description This dataset includes 117 Lightwheel-Libero-Tasks and 89 Lightwheel-Robocasa-Tasks, collected using the x7s robot in environments provided

  • Tabular
  • Timeseries
  • Parquet

retail_restock_11_feb_no_verb

372 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Unitree_G1_Dex3", "total_episodes": 124, "total_frames": 13344, "total_tasks": 3,

  • Image
  • Timeseries
  • Parquet

cmu_stretch_lerobot

370 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "

  • Tabular
  • Timeseries
  • Parquet

fold_combined_gt

370 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Data Distribution Overview This figure summarizes the data distribution of the ywxia/fold_combined_gt dataset, auto-generated after each con

  • Image
  • Timeseries
  • Parquet

berkeley_autolab_ur5_train_400_500_augmented

369 downloads · cc-by-4.0

berkeley_autolab_ur5_train_400_500 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,479 Videos: 900

  • Tabular
  • Text
  • Parquet

omx_f_omx_blue_block_yellow_basket_random

368 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "omx_f", "total_episodes": 99, "total_frames": 14858, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

Exylos_PickNPlace_Sample

368 downloads · cc-by-nc-4.0

Exylos Pick & Place Sample Dataset Summary Exylos Pick & Place Sample is a compact multi-view robotics dataset in LeRobot-style structure for a single manipulation task: pick up an object and place it

rl_pick_up_block_pi_rlt

366 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5", "total_episodes": 164, "total_frames": 34029, "total_tasks": 1, "chunks_si

  • Tabular
  • Timeseries
  • Parquet

briobin

364 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 51, "total_frames": 22577, "total_tasks": 1, "chu

  • Tabular
  • Timeseries
  • Parquet

merged_libero_mask_depth_noops_lerobot_10

363 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 200, "total_frames": 33587, "total_tasks": 40, "total_

  • Tabular
  • Timeseries
  • Parquet

logu_bimanual_joint_multi_task

363 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 212, "total_frames": 101703, "total_tasks": 2, "total_vide

  • Video

mimic_ttt_redBalanced_30hz

362 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 340, "total_frames": 289608, "total_tasks": 9,

  • Tabular
  • Timeseries
  • Parquet

bc_z_lerobot

360 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 1

  • Video

portafilter-regrasp-v6-slow-sticker-merged-remapped-clean

360 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 221, "total_frames": 203035, "total_tasks": 1

  • Tabular
  • Timeseries
  • Parquet

h0_post_train_db_2508

357 downloads · mit

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos We introduce Being-H0, the first dexterous Vision-Language-Action model pretrained from large-scale human videos via explicit

  • Image
  • Text
  • JSON

iamlab_cmu_pickup_insert_train_500_631_augmented

357 downloads · cc-by-4.0

iamlab_cmu_pickup_insert_train_500_631_augmented Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, sawyer, ur5e, widowX, xarm7 FPS: 20.0 Episodes: 131 Frames: 30,

  • Tabular
  • Text
  • Parquet

dk1_duplo_disassembly_2026-04-04

357 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Bimanual duplo disassembly demonstrations with DK1 robot Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: sepa

  • Tabular
  • Timeseries
  • Parquet

vla-poursing-seeds-v1

357 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 135, "total_frames": 65250, "total_tasks": 5, "ch

  • Tabular
  • Timeseries
  • Parquet

berkeley_autolab_ur5_train_300_400_augmented

355 downloads · cc-by-4.0

berkeley_autolab_ur5_train_300_400 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,934 Videos: 900

  • Tabular
  • Text
  • Parquet

Android-Control

355 downloads · apache-2.0

Android Control (Community Mirror) This is a community mirror of the official Android Control dataset by Google Research, hosted on Hugging Face for easier access. The original dataset is hosted on Go

bridge_test_0_3475_augmented

354 downloads · cc-by-4.0

bridge_test_0_3475_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 3,475 Frames: 118,603 Splits: train: 0:

  • Tabular
  • Text
  • Parquet

eval_real_so101_record_v1

354 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 751, "total_tasks": 1, "chunks

  • Tabular
  • Timeseries
  • Parquet

pp_fold_fix

352 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks":

  • Tabular
  • Timeseries
  • Parquet

Maniskill-Pushcube-demonstration-large

351 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 1860, "total_frames": 0, "total_tasks": 0, "total_videos":

  • Image
  • Timeseries
  • Parquet

bridge_train_15000_20000_augmented

351 downloads · cc-by-4.0

bridge_train_15000_20000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 169,127 Splits: tra

  • Tabular
  • Text
  • Parquet

Bright2026_19153

351 downloads · apache-2.0

Bright 2026 Benchmark - Robot 19153 (同济子豪兄 (张子豪团队)) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimens

  • Video

bridge_train_5000_10000_augmented

350 downloads · cc-by-4.0

bridge_train_5000_10000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 168,737 Splits: trai

  • Tabular
  • Text
  • Parquet

data_grapes

349 downloads · apache-2.0

FR5 Garlic Manipulation Dataset This dataset contains demonstrations collected on an FR5 robot teleoperated to manipulate garlic. Each episode pairs synchronized joint trajectories (observation.state,

  • Tabular
  • Video
  • Parquet

test_dataset

348 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "dianarm", "total_episodes": 26, "total_frames": 6847, "total_tasks": 1, "chunks_s

  • Tabular
  • Timeseries
  • Parquet

asu_table_top

347 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 110, "total_frames": 26113, "total_tasks": 216, "tota

  • Tabular
  • Timeseries
  • Parquet

Master_wire_bend_varying_3wire_20260319_fix

346 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "c

  • Tabular
  • Timeseries
  • Parquet

V2X-Radar

345 downloads · cc-by-nc-nd-4.0

V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception Lei Yang · Xinyu Zhang · Jun Li · Chen Wang · Jiaqi Ma · Zhiying Song · Tong Zhao · Ziying Song · Li Wang · Mo Zhou · Yang She

soarm101_pickplace_multicolor_v1

345 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 30, "total_frames": 27724, "total_tasks": 2, "

  • Tabular
  • Timeseries
  • Parquet

pick

344 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 9, "total_frames": 289, "total_tasks": 2, "total_videos

  • Image
  • Timeseries
  • Parquet

RoboTwin_stack_bowls_three_randomized

343 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 223728, "total_tasks": 343, "chunk

  • Image
  • Timeseries
  • Parquet

Master_wire_hook_20260326

343 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 111, "total_frames": 62196, "total

  • Tabular
  • Timeseries
  • Parquet

something_something_v2_lerobot

342 downloads · apache-2.0

🤖 Custom LeRobot Dataset This dataset was created using LeRobot,and follows the LeRobot v2.1 dataset specification for robotic control and imitation learning. 📘 Dataset Description Codebase version:

threeBall_origin

342 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 101, "total_frames": 247024, "total_tasks": 2,

  • Tabular
  • Timeseries
  • Parquet

mimic_ttt_redx_ALLBAL_15hz_stable

342 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 18

  • Tabular
  • Timeseries
  • Parquet

ww_dataset_abs_joint_abs_jointnapkin_only

342 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ur5", "total_episodes": 100, "total_frames": 12696, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

bi_so101_baseline_4cam

340 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 303, "total_frames": 373511, "total_tasks": 1,

  • Tabular
  • Text
  • Parquet

dlr_sara_grid_clamp

339 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 107, "total_frames": 7622, "total_tasks": 1, "total_v

  • Tabular
  • Timeseries
  • Parquet

VLA_Arena_L0_S_lerobot_openpi

339 downloads · apache-2.0

VLA-Arena Dataset (L0 - Small Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models. It provides a complete and u

  • Image
  • Timeseries
  • Optimized Parquet

VLA_Arena_L1_M_lerobot_smolvla

339 downloads · apache-2.0

VLA-Arena Dataset (L1 - Medium Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models. It provides a complete and

  • Image
  • Timeseries
  • Optimized Parquet

full_ttt_redx_stable_fixed_short_labels

339 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9,

  • Tabular
  • Timeseries
  • Parquet

redx_stable

338 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9,

  • Tabular
  • Timeseries
  • Parquet

Master_wire_hook_20260327

337 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 128, "total_frames": 74668, "total

  • Tabular
  • Timeseries
  • Parquet

PhysicalAI-GR00T-Tuned-Tasks

335 downloads · cc-by-4.0

Dataset Description: This dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (GR1) tabletop manipulation tasks for industrial settings. Each dataset entry p

  • Tabular
  • Video
  • Parquet

bridge_train_10000_15000_augmented

335 downloads · cc-by-4.0

bridge_train_10000_15000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,583 Splits: tra

  • Tabular
  • Text
  • Parquet

MS-HAB-PrepareGroceries

334 downloads · mit

ManiSkill-HAB PrepareGroceries Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB PrepareGrocer

thermal_cycler_open-mujoco

333 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 84994, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Parquet

G1_Brainco_GraspRubiksCube_Dataset

333 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 197, "total_frames": 220788, "total_tasks"

  • Tabular
  • Timeseries
  • Parquet

2025-challenge-demos

331 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t

  • Tabular
  • Timeseries
  • Parquet

berkeley_autolab_ur5_train_800_896_augmented

330 downloads · cc-by-4.0

berkeley_autolab_ur5_train_800_896 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 96 Frames: 9,360 Videos: 864

  • Tabular
  • Text
  • Parquet

G1_Brainco_Toast_Dataset

329 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_

  • Tabular
  • Timeseries
  • Parquet

soarm101_pickplace_10d_240e_closed

326 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 240, "total_frames": 180027, "total_tasks": 1,

  • Tabular
  • Timeseries
  • Parquet

vlabench_unified

326 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 10977, "total_frames": 3114872, "total_tasks": 295, "c

  • Tabular
  • Timeseries
  • Parquet

justaddforce-data

325 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 130, "total_frames": 4435, "total_tasks": 36, "chunks_size

G1_Brainco_StackCups_Dataset

325 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_

  • Tabular
  • Timeseries
  • Parquet

bi-so101-insert-screw-271ep

325 downloads · apache-2.0

bi-so101-insert-screw-271ep Bimanual teleoperation dataset for the task "Insert the copper screw into the black sleeve", collected with SO-101 dual arms and LeRobot 0.5.0. Dataset Summary Field Value

so101_multi_task_v2

324 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 59, "total_frames": 20844, "total_tasks": 2, "

  • Tabular
  • Timeseries
  • Parquet

libero90_successes

324 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 272, "total_frames": 11334, "total_tasks": 32, "total_v

  • Image
  • Timeseries
  • Parquet

real_so101_record_v1

324 downloads · apache-2.0

real_so101_record_v1 A real-world LeRobot-format dataset collected on an SO101/SO100-style follower arm with three camera views. Summary This dataset contains robot demonstrations recorded with lerobo

  • Tabular
  • Timeseries
  • Parquet

build_block_tower

322 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "arx5", "total_episodes": 60, "total_frames": 41832, "total_tasks": 1, "total_vide

  • Tabular
  • Timeseries
  • Parquet

lerobot_cli_mobile_block_transfer_v2

322 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_mobile", "total_episodes": 48, "total_fr

  • Tabular
  • Timeseries
  • Parquet

berkeley_autolab_ur5_train_0_100_augmented

321 downloads · cc-by-4.0

berkeley_autolab_ur5_train_0_100 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,863 Videos: 900 C

  • Tabular
  • Text
  • Parquet

pickplace_r2

321 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 75051, "total_tasks": 1, "total_vi

  • Tabular
  • Timeseries
  • Parquet

full_ttt_redx_stable_fixed_labels

321 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9,

  • Tabular
  • Timeseries
  • Parquet

dlr_sara_pour

320 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 12971, "total_tasks": 1, "total_

  • Tabular
  • Timeseries
  • Parquet

dlr_edan_shared_control

320 downloads · mit

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 104, "total_frames": 8928, "total_tasks": 14, "total_

  • Tabular
  • Timeseries
  • Parquet

G1_Dex3_CameraPackaging_Dataset

320 downloads · apache-2.0

This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as

  • Tabular
  • Timeseries
  • Parquet

pp_fold

320 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks":

  • Tabular
  • Timeseries
  • Parquet

droid_xxjd_7_canonical

320 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 80, "total_frames": 33909, "total_tasks": 26, "chunks_

  • Tabular
  • Text
  • Parquet

shawn-pickup2

320 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_stationary", "total_episodes": 1, "total

  • Tabular
  • Timeseries
  • Parquet

pick_up_single_colored_cube_into_tape_v7

320 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 226, "total_frames": 120729, "total_tasks": 3,

  • Tabular
  • Timeseries
  • Parquet

piper_piper_pickdrop_bottle_70

319 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 70, "total_frames": 42041, "total_tasks": 1, "total_vid

  • Tabular
  • Timeseries
  • Optimized Parquet

language_table_lerobot

318 downloads · apache-2.0

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605,

  • Tabular
  • Timeseries
  • Parquet

PND_Adam-U_pour-cup

317 downloads · cc-by-4.0

This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Citation

  • Tabular
  • Timeseries
  • Parquet

RESEARCH PATHS

Use this record as part of a broader dataset review

A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.

For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.

Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.

INTERNAL LINKS

Continue the buyer workflow

EXTERNAL REFERENCES

Source context to verify

TRUELABEL ROUTING

Found a candidate worth buying or benchmarking?

Turn the Hugging Face record into a buyer-ready request with sample QA, rights review, conversion proof, and deployment-fit checks.

Generate bounty spec