Indexed records
1,000 robotics-tagged dataset records.
ROBOTICS DATASET INDEX
A broad source-backed watchlist for physical AI teams comparing robotics datasets, licenses, formats, modalities, and ingestion risk across the Hugging Face Hub.
DIRECT ANSWER
This index tracks 1,000 Hugging Face records tagged for robotics. It is the breadth layer for truelabel: use it to discover candidates, then promote important records into deeper dataset profiles, comparisons, and custom collection specs.
1,000 robotics-tagged dataset records.
5,343,188 combined downloads across the indexed records.
May 1, 2026, 9:28 PM
BREADTH MAP
616 INDEXABLE RECORDS
Tier-A and Tier-B records only. The remaining 384 records are consolidated under author clusters.
616 of 616 datasets
350,718 downloads · cc-by-4.0
PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robo
346,008 downloads · apache-2.0
This dataset was created using LeRobot. DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset One of the biggest open-source dataset for robotics with 27.044,326 frames, 92,223 episodes, 31,308
304,654 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Franka", "total_episodes": 95600, "total_frames": 27612581, "total_tasks": 0, "to
299,775 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 92233, "total_frames": 27044326, "total_tasks": 31308,
200,677 downloads · cc-by-nc-sa-4.0
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling 🎉NEWS [2026.3.21] 🔥 OmniWorld-Game with Metric Scale is now released! Check out our latest model Pi3X (an enhanced
200,660 downloads · cc-by-sa-4.0
Boasting over 13,000 hours of cumulative data and 5 million+ clips, it ranks as the largest open-source embodied intelligence dataset in the industry. Update Notes:Stage 3 data upload completed. 13,00
188,007 downloads · other
⚠️ Important: If you have already submitted an access request but have not completed the required DocuSign agreement, your request will remain pending. Please complete signing and we will grant access
165,979 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605,
151,758 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974,
126,607 downloads · cc-by-4.0
PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robo
96,222 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 5
78,972 downloads · cc-by-nc-sa-4.0
KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task
75,097 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t
68,418 downloads · not specified
Key Features 🔑 1 million+ trajectories from 100 robots, with a total duration of 2976.4 hours. 100+ real-world scenarios across 5 target domains. Cutting-edge hardware: visual tactile sensors / 6-DoF
63,389 downloads · cc-by-nc-4.0
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language
60,561 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,
54,264 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974,
50,345 downloads · apache-2.0
This dataset is part of the RoboVerse project, as described in the paper RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning.
47,824 downloads · cc-by-nc-4.0
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language
47,595 downloads · mit
Cloth-Folding Dataset for X-VLA Paper This dataset contains 1,500 episodes of cloth folding, collected using Agilex's robotic arm. It was used in the X-VLA paper for cloth-folding tasks, showcasing a
47,447 downloads · cc-by-nc-sa-4.0
AgiBot World 2026 Real-World Embodied Intelligence Dataset Overview As robotics research advances into real-world scenarios, the demand for authentic, high-quality data has become increasingly urgent.
47,044 downloads · cc-by-sa-4.0
InternData-N1 InternData-N1 is a large-scale, unified vision-language navigation dataset consolidating multiple benchmarks into a standardized format. With over 240,000 trajectories across 3,000+ scen
46,810 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "kuka_iiwa", "total_episodes": 209880, "total_frames": 2455879, "total_tasks": 1,
44,824 downloads · cc-by-nc-4.0
ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research
40,682 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10. All datasets were taken fro
37,477 downloads · cc-by-4.0
Dataset Description: Open-H-Embodiment is a community‑driven dataset initiative building the open, shared foundation needed to train and evaluate AI autonomy models for surgical robotics and ultrasoun
37,008 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
36,451 downloads · not specified
InternData-A1 InternData-A1 is a hybrid synthetic-real manipulation dataset containing over 630k trajectories and 7,433 hours across 4 embodiments, 18 skills, 70 tasks, and 227 scenes, covering rigid,
33,980 downloads · cc-by-nc-4.0
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language
30,417 downloads · cc-by-4.0
PhysicalAI-Robotics-Manipulation-Kitchen-Demos We provide a 600 hours of human-teleoperated demonstrations across 316 different tasks, totalling 55k trajectories. The datasets are collected using Fran
30,062 downloads · apache-2.0
ManipArena Dataset Training dataset for ManipArena, a real-robot benchmark and competition for bimanual manipulation at the CVPR 2026 Embodied AI Workshop. This dataset provides rich multi-modal demon
28,902 downloads · cc-by-4.0
PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a
28,380 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
27,922 downloads · apache-2.0
DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter DECO-50 is a bimanual dexterous manipulation dataset with tactile sensing, comprising
26,167 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid
25,327 downloads · cc-by-nc-sa-4.0
InternData-A1 dataset taken from InternRobotics/InternData-A1, with the tarballs extracted and directory structure "transposed" so that the top-level subdirectories are the four embodiments. Two frank
25,323 downloads · mit
ViFailback Dataset: Real-World Robotic Manipulation Failure Dataset with Visual Symbol Guidance A real-world dataset for diagnosing, correcting, and learning from robotic manipulation failures via vis
25,031 downloads · apache-2.0
Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world roboti
24,673 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunk
23,852 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks":1, "total_v
22,090 downloads · cc-by-4.0
PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a
21,791 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,
21,084 downloads · cc-by-4.0
PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a
20,763 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks
19,757 downloads · cc-by-4.0
Open X-Embodiment Dataset (unofficial) This is an unofficial Dataset Repo. This Repo is set up to make Open X-Embodiment Dataset (55 in 1) more accessible for people who love huggingface🤗. Open X-Emb
18,546 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 10770, "total_frames": 26748966, "total_tas
18,187 downloads · apache-2.0
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation Accepted by Robotics: Science and Systems (RSS) 2025. 📢 ANNOUNCEMENT | RoboMIND V2.0 Release We're excited t
16,949 downloads · cc-by-4.0
DROID Success Episodes Successful episodes in DROID-COMMUNITY. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~89% of all successful episodes. Ported from raw
16,673 downloads · not specified
InternScenes InternScenes is a large-scale interactive indoor scene dataset with realistic layouts. This dataset comprises approximately 40,000 diverse scenes and 1.96M 3D objects that cover 15 common
16,413 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_vid
15,868 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630,
15,632 downloads · apache-2.0
ArtVIP dataset card 🎉🎉🎉 ArtVIP is accepted by ICLR 2026 Key Features ✅ 206 high-quality digital-twin articulated objects. ✅ 6 pre-configured digital-twin scenes, 6 user-defined scenes ✅ Reusable Mo
15,605 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5688, "total_frames": 14129038, "total_task
15,331 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,
14,567 downloads · mit
3D Optical Flow DROID Dataset Processed DROID robotics dataset with optical flow and scene flow annotations. Dataset Structure Organized by lab, each trajectory in separate tar.gz archive: IPRL/IPRL+2
14,562 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
14,048 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid
13,952 downloads · mit
📦 FreeTacman Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation [ICRA 2026] 🎯 Overview This dataset supports the paper FreeTacman: Robot-free Visuo-Tactile Data Collection
13,951 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5551, "total_frames": 13805156, "total_task
13,329 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks": 1, "total_
12,690 downloads · apache-2.0
ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research
12,422 downloads · apache-2.0
RoboChallenge-lerobot-merged Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset (
12,281 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
12,120 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "chunks_si
12,110 downloads · cc-by-nc-sa-4.0
KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task
12,097 downloads · not specified
⚠️Important Notice !!! Dear Users, The Alpha Dataset has been updated as follows: Frame Loss Data Removal: Several episodes with frame loss issues have been removed. For the complete list of removed e
10,991 downloads · apache-2.0
Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world roboti
10,455 downloads · apache-2.0
AgiBot-g1_picks_up_parts_b 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Robot Type: ruantong_a2d | Codebase Version: v2.1 End-Effector Type:
10,423 downloads · cc-by-4.0
GNN Constraint-Aware World Model Dataset (v3) Real robot episodes with per-frame constraint graphs, SAM2 segmentation masks + 256-D feature embeddings, full 3D depth bundles, and synchronized robot st
10,280 downloads · cc-by-nc-sa-4.0
RealSource World RealSource World is a large-scale real-world robotics manipulation dataset collected using RS-02 dual-arm humanoid robot. This dataset contains diverse long-horizon manipulation tasks
9,997 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5413, "total_frames": 13497374, "total_task
9,832 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_vid
9,767 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid
9,744 downloads · bsd-3-clause
ThinkJEPA Proprocessed Data Cache Dataset Description This repository contains the released preprocessed cache used by ThinkJEPA: ThinkJEPA: Empowering Latent World Models with Large Vision-Language R
9,734 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5357, "total_frames": 13251592, "total_task
9,650 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid
9,615 downloads · mit
The GrandTour Dataset A project brought to you by RSL - ETH Zurich. References • Contributing • Citation References Official dataset webpage: grand-tour.leggedrobotics.com Getting started & examples:
9,610 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid
9,034 downloads · apache-2.0
RoboCerebra (LeRobot v2.1 Format) Dataset Description This dataset is a converted version of RoboCerebra, now formatted for compatibility with the LeRobot ecosystem (v2.1). While the official RoboCere
8,738 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c
8,213 downloads · apache-2.0
TL;DR of L2D, the world's largest self-driving dataset! Read more about L2D on the official Huggingface blog: LeRobot goes to driving school 90+ TeraBytes of multimodal data (5000+ hours of driving) f
8,119 downloads · apache-2.0
Sim1_Dataset LeRobot-format manipulation dataset for cloth folding / deformable object tasks. Format This repository follows a LeRobot-style layout per subset: data/chunk-xxx/episode_XXXXXX.parquet: f
8,102 downloads · cc-by-nc-sa-4.0
LARY — A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment LARY is a unified evaluation framework for latent action representations. Given any model that pro
7,568 downloads · not specified
AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Mead
7,252 downloads · cc-by-4.0
Bridge-CoT Robot manipulation dataset with chain-of-thought annotations, derived from BridgeDataV2. Each sample pairs a scene image with a task description and structured VLM-generated annotations inc
7,053 downloads · apache-2.0
Agilex_Cobot_Magic_place_towel_flat Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Task Preview View Video Directly Overview Total Epis
6,931 downloads · other
Dynamic Object Manipulation (DOM) Project Page | Paper | Code TL;DR: DOM is a large-scale dynamic manipulation dataset with 200K episodes, 2,800+ scenes, and 206 objects for training and evaluating VL
6,916 downloads · cc-by-4.0
DROID Success (High-Quality Extrinsics) Subset of ~17k successful episodes in DROID-COMMUNITY filtered for high-quality camera extrinsics. Ported from raw 1.0.1 data at full resolution to LeRobotDatas
6,868 downloads · mit
Reconstructed GNM Trajectories GNM-format trajectories reconstructed from RECON, SCAND, GoStanford2, and other navigation datasets. Used as training data for the CAST safety pipeline and Neural CBF. S
6,845 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Description This dataset contains MolmoAct Dataset in lerobot format. All contents in this dataset were collected in-house by Ai2. Quick links: 📂 All M
6,566 downloads · mit
SandThink Dataset (v1.0) SandThink 是一个专为具身智能 (Embodied AI) 任务设计的大规模指令微调与偏好对齐数据集。该数据集通过结构化的 Chain-of-Thought (CoT) 推理过程,显著提升了 Vision-Language-Action (VLA) 模型在复杂环境下的任务拆解、路径规划和动作执行能力。 📊 数据集概览 (Dataset S
6,517 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630,
6,440 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 8612, "total_frames": 1137459, "total_tasks": 24, "tot
6,384 downloads · mit
Comet-1.5k — RFT Dataset (Openpi-Comet) This repository releases Comet-1.5k, an RFT (Rejection Sampling Fine-Tuning) dataset generated by Team Comet’s post-training pipeline on the BEHAVIOR-1K simulat
6,361 downloads · apache-2.0
RoboChallenge-lerobot Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset in LeRob
6,311 downloads · apache-2.0
Agilex_Cobot_Magic_heat_burger Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Task Preview View Video Directly Overview Total Episodes:
6,044 downloads · apache-2.0
Cobot_Magic_make_hamburger 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Robot Type: agilex_cobot_decoupled_magic | Codebase Version: v2.1 End
5,993 downloads · mit
Edge Agent Reasoning WebSearch 260K Abstract The Edge-Agent-Reasoning-WebSearch-260K dataset is a massive, synthetically expert-engineered corpus of over 700 Million tokens, designed to train small, l
5,976 downloads · apache-2.0
GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobo
5,929 downloads · cc-by-4.0
MolmoAct - Pretraining Mixture Data Mixture used for MolmoAct Pretraining. Contains a subset of OXE formulated as Action Reasoning Data along with auxiliary robot data and link to Multimodal Web data.
5,830 downloads · apache-2.0
Agilex_Cobot_Magic_storage_towel Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot. Task Preview View Video Directly Overview Total Episode
5,814 downloads · apache-2.0
Wanderland Dataset Dataset Description Wanderland is a large-scale urban dataset designed for geometrically grounded simulation and open-world embodied AI research. The dataset contains diverse urban
5,746 downloads · cc-by-nc-4.0
PDD: Personalized Driving Dataset Dataset Description PDD (Personalized Driving Dataset) is a multi-driver, multi-scenario driving dataset collected in CARLA 0.9.15. It captures real human driving beh
5,705 downloads · cc-by-4.0
Dataset Description: PhysicalAI-Robotics-Manipulation-SingeArm is a collection of datasets of automatic generated motions of a Franka Panda robot performing operations such as block stacking, opening
5,682 downloads · not specified
RoboInter-Data: Intermediate Representation Annotations for Robot Manipulation Rich, dense, per-frame intermediate representation annotations for robot manipulation, built on top of DROID and RH20T. D
5,532 downloads · mit
Dataset Description This dataset contains egocentric human-humanoid data that can be used to co-train manipulation policy for humanoid robot. For paper, visualization, and example code using this data
5,403 downloads · not specified
R3D: Revisiting 3D Policy Learning Project Page | Paper | GitHub This repository contains the pre-processed data for R3D, a framework for robust and scalable 3D imitation learning. R3D identifies the
5,400 downloads · cc-by-nc-4.0
SAGE-3D InteriorGS USDZ: USDZ-Format 3D Gaussian Scenes for Isaac Sim Paper | Project Page | Code InteriorGS dataset converted to USDZ format for seamless integration with NVIDIA Omniverse and Isaac S
5,360 downloads · other
FastUMI Pro – Multimodal Sample Dataset Small-Scale Demonstration Data from the FastUMI Pro Multimodal Sensing System (Only Hundreds of Trajectories — Full Dataset Available Upon Request) Project Home
5,057 downloads · cc-by-nc-sa-4.0
WM-Eval Samples: EWMBench Generated Videos Pre-generated video samples for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework. Use this dataset as vi
5,037 downloads · apache-2.0
CostNav Teleop Dataset Dataset Summary The CostNav Teleop Dataset is a large-scale collection of human teleoperation recordings for robot navigation in an urban sidewalk simulation environment. It was
4,952 downloads · mit
LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving Project Page | Paper | Code Official CARLA dataset accompanies our paper LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving
4,938 downloads · apache-2.0
📦 RoboCerebra HDF5 Dataset This repository contains the structured HDF5 dataset for the RoboCerebra project, optimized for high-performance robot learning training. ✅ Status: Data Upload Complete. Al
4,673 downloads · cc-by-4.0
NVIDIA Physical AI SimReady Warehouse OpenUSD Dataset Dataset Version: 1.1.0 Date: May 18, 2025 Author: NVIDIA, Corporation License: CC-BY-4.0 (Creative Commons Attribution 4.0 International) Contents
4,665 downloads · cc-by-4.0
RoboBench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain 📋 Overview RoboBench is a comprehensive evaluation benchmark designed to assess the capabilities
4,597 downloads · apache-2.0
SceneSmith Example Scenes Project Page | Paper | Code Example scenes generated by SceneSmith, a hierarchical agentic framework for constructing simulation-ready indoor environments from natural langua
4,534 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
4,529 downloads · apache-2.0
Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets acro
4,466 downloads · mit
Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Bridge dataset
4,453 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks":40, "chunks
4,442 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596,
4,422 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974,
4,357 downloads · apache-2.0
Community Dataset v1 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 128 datasets from 55 contributors worldwide. We used this dataset to pretrain S
4,302 downloads · apache-2.0
🤖 EO-Data-1.5M A Large-Scale Interleaved Vision-Text-Action Dataset for Embodied AI The first large-scale interleaved embodied dataset emphasizing temporal dynamics and causal dependencies among visi
4,300 downloads · cc-by-4.0
CARLA Stage 2 Pedestrian Dataset A large-scale driving dataset (Stage 2) focused on pedestrians, captured from CARLA simulator. This dataset contains RGB images and depth maps with camera parameters,
4,286 downloads · mit
Pick object and place in box Project KIWI | Qian Group HRI Lab | University of Houston Detail Value Task Pick object and place in box Episodes 50 FPS 30 Cameras Gripper (Arducam) + Overhead (Intel Rea
4,151 downloads · cc-by-4.0
Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and a
4,122 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
3,958 downloads · apache-2.0
PEEK VLM-Labeled BRIDGE_v2 dataset This dataset contains the LeRobot-format BRIDGE-v2 dataset with paths and masks from the PEEK VLM drawn onto the image: PEEK: Guiding and Minimal Image Representatio
3,906 downloads · cc-by-4.0
Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and a
3,902 downloads · cc-by-4.0
Object Pose Estimation Using Implicit Representation for Transparent Objects This dataset aggregates high quality 3D mesh assets and rendered data for training and fine-tuning pose estimation models.
3,873 downloads · mit
PhysProbe Dynamics Probing Dataset Manipulation episodes from Isaac Lab collected for probing physics understanding in video world models (V-JEPA 2, VideoMAE, DINOv2). Each episode includes dual-camer
3,832 downloads · cc-by-4.0
DROID Failure Episodes Failure episodes in DROID-COMMUNITY. Episodes with inconsistent data were discarded, leaving ~91% of all failure episodes. Ported from raw 1.0.1 data at full resolution to LeRob
3,669 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
3,630 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 3921, "total_frames": 569249, "total_tasks": 73, "tota
3,608 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
3,607 downloads · apache-2.0
Note This is an unofficial implementation. This dataset was converted from AgiBotWorld-Alpha into the LeRobot format using https://github.com/Tavish9/any4lerobot/tree/main/agibot2lerobot Configuration
3,565 downloads · cc-by-nc-4.0
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language
3,547 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
3,535 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 1
3,530 downloads · odc-by
UrbanVerse-100K Dataset [!NOTE] UrbanVerse-100K is a large-scale, physics-aware 3D asset and material database curated for urban simulation, physical and embodied AI research. It contains over 102K me
3,496 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
3,488 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
3,478 downloads · cc-by-nc-nd-4.0
Bench2Drive-Speed Project Page | Paper | GitHub Bench2Drive-Speed is a closed-loop benchmark for desired-speed conditioned autonomous driving, enabling explicit control over vehicle behavior through t
3,468 downloads · other
BONES-SEED: Skeletal Everyday Embodiment Dataset BONES-SEED (Skeletal Everyday Embodiment Dataset) is an open dataset of 142,220 annotated human motion animations for humanoid robotics. It provides mo
3,443 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid
3,435 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
3,389 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
3,362 downloads · cc-by-4.0
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-sc
3,360 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 2366, "total_frames": 6205242, "total_tasks
3,339 downloads · cc-by-4.0
Retargeted AMASS Dataset for G1 This repository contains 100% of the AMASS dataset retargeted to the G1 humanoid and formatted for use with IsaacLab's AMP motion loader. It should also be compatible w
3,307 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 5
3,274 downloads · mit
The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments,
3,268 downloads · cc-by-nc-sa-4.0
WM-Eval GT: EWMBench Ground Truth Ground truth data for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework. Source Extracted from agibot-world/EWMBen
3,230 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 85, "total_frames": 127500, "total_tasks": 1, "chunks_s
3,216 downloads · apache-2.0
Community Dataset v2 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 340 datasets from 117 contributors worldwide. This dataset represents the secon
3,206 downloads · mit
DR_dataset Simulated robot manipulation dataset for imitation learning. Dataset Description This dataset contains 200 episodes of a Trossen WXAI robotic arm performing food transfer tasks in MuJoCo si
3,196 downloads · cc-by-nc-sa-4.0
We present the Oxford Spires Dataset, captured in and around well-known landmarks in Oxford using a custom-built multi-sensor perception unit as well as a millimetre-accurate map from a terrestrial Li
3,160 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "chunks_s
3,007 downloads · apache-2.0
Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Ti
3,000 downloads · apache-2.0
LeWAM Community Dataset A large-scale community-contributed robotics dataset for robot learning research featuring 321 datasets from the LeRobot community. Derived from Community Dataset v2, and conve
2,999 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AgiBot_A2D", "total_episodes": 28122, "total_frames": 47613574, "total_tasks": 30
2,955 downloads · apache-2.0
This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 82092, "to
2,951 downloads · apache-2.0
Dataset Card for aloha_pen_uncap This dataset is a FiftyOne conversion in LeRobot format of the aloha_pen_uncap_diverse subset of BiPlay. The aloha_pen_uncap_diverse subset is a task-specific segment
2,937 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "frodobot", "total_episodes": 185761, "total_frames": 163360777, "total_tasks": 1,
2,887 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1600, "total_frames": 768897, "total_tasks": 116, "chun
2,849 downloads · apache-2.0
Towards Generalizable Robotic Manipulation in Dynamic Environments Heng Fang1, Shangru Li1, Shuhan Wang1, Xuanyang Xi2, Dingkang Liang1, Xiang Bai1 1 Huazhong University of Science and Technology, 2 H
2,838 downloads · cc-by-nc-sa-4.0
MarketGen : A Scalable Simulation Platform with Auto-Generated Embodied Supermarket Environments 📑 arXiv | 🌐 Project | 🤗 Hugging Face MarketGen is a scalable simulation platform with automatic scen
2,811 downloads · apache-2.0
This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.
2,747 downloads · not specified
AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Mead
2,692 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "tota
2,661 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
2,655 downloads · mit
VLABench Primitive Tasks Dataset - LeRobot v3.0 Dataset Description This dataset is organized in the LeRobot v3.0 format and is used for integrating VLABench into the LeRobot framework officially. Com
2,650 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
2,644 downloads · apache-2.0
ManiSkill Demonstrations This dataset repo contains all of the latest ManiSkill demonstration datasets as well as some pretained model weights used to generate some demonstrations. To download by envi
2,643 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
2,626 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_
2,575 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 32212, "total_tasks": 47, "total
2,532 downloads · apache-2.0
This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "total_episodes": 857, "total_frames": 85474, "total_videos": 2828, "codebase_version": "v2.1", "robot
2,515 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19229 (MVP) Competition task: 抓水果 (Fruit Grasping) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-d
2,494 downloads · cc-by-nc-4.0
SAGE-3D Collision Mesh: Physics-Enabled Collision Bodies for 3D Gaussian Scenes Paper | Project Page | Code High-precision collision geometry dataset extracted from 1,000 indoor Mesh scenes, enabling
2,452 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 91, "total_frames": 73578, "total_tasks": 1, "c
2,434 downloads · mit
Dataset Structure These datasets are used for MemoryVLA training. This is the standard LIBERO setting and can be directly used for other models as well.All data follow the RLDS format from the LIBERO
2,432 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Converted from TrossenMCAP format using the Trossen SDK trossen_mcap_to_lerobot_v2 tool. Dataset Structure meta/info.json: { "codebase_versi
2,423 downloads · cc-by-sa-4.0
from datasets import Features, Value, Video, load_dataset features = Features({ "head_color.mp4": Video(), "proprio_stats.hdf5": Value("binary"), "task.json": { "episode_id": Value("int64"), "init_sce
2,377 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "tot
2,347 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
2,275 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 5208, "total_frames": 1139911, "total_tasks": 7
2,236 downloads · cc-by-nc-4.0
D2E-480p Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI. 268.7 hours of sy
2,232 downloads · mit
Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Fractal datase
2,230 downloads · cc-by-nc-4.0
D2E-Original Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI. 273.4 hours o
2,197 downloads · mit
SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation This is the accompying dataset for the Social Navigation Scene Understanding Benchmark (SocialNav-SUB) which is a Vi
2,174 downloads · cc-by-4.0
Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and a
2,147 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1647, "total_frames": 42328, "total_tasks": 1, "chunk
2,118 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5", "total_episodes": 896, "total_frames": 87783, "total_tasks": 5, "total_vide
2,091 downloads · apache-2.0
This is ScanNet dataset containing indoor scenes which is used for 3d object detection, 3d segmentation, etc. Acknowledgement @inproceedings{dai2017scannet, title={ScanNet: Richly-annotated 3D Reconst
2,075 downloads · cc-by-sa-4.0
Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments Description RoboSense is a large-scale multimodal dataset constructed to facil
2,059 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
2,046 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 638, "total_frames": 421684, "tota
2,011 downloads · bsd-2-clause
Dataset Overview The Amelia42-Mini dataset provides air traffic position reports for 42 major U.S. airports, including the following airports: KATL (Hartsfield-Jackson Atlanta International Airport) K
1,999 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
1,998 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 703, "total_frames": 456261, "tota
1,993 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1, "total_frames": 841, "total_tasks": 1, "
1,947 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description The English version of this dataset integrates 598 open-source community datasets into a single unified corpus, comprising 22,709 episodes a
1,913 downloads · mit
ATG-MoE Pressure-Reducing Valve Assembly Training Set This repository provides the training set used in our paper: "ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly s
1,903 downloads · cc-by-nc-4.0
COMMAND Dataset 😃 🤗 Dataset Availability The evaluation data from this dataset is now available on Hugging Face: Repository: maum-ai/COMMAND Content: 60 evaluation scenarios (240 files, ~38GB) File
1,870 downloads · mit
LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models LIBERO-Para is a controlled benchmark for evaluating the paraphrase robustness of Vision-Language-Action (VLA) m
1,858 downloads · cc-by-4.0
MolmoAct - Midtraining Mixture Data Mixture used for MolmoAct Midtraining. Contains MolmoAct Dataset formulated as Action Reasoning Data. MolmoAct is a fully open-source action reasoning model for rob
1,855 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
1,844 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_
1,842 downloads · not specified
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning 🔑Key Features MesaTask-10K, a large-scale dataset for task-oriented tabletop scene generation, comprises approximately
1,825 downloads · apache-2.0
SandGO Dataset (v1.0) SandGO 是一个专为具身智能 (Embodied AI) 和机器人学习设计的 long-horizon 多模态数据集。该数据集由原始 MarsMind_data 经过精炼、去冗余和结构化处理后得到,旨在支持 长序列决策、指令跟随 和 多模态模仿学习 等任务。数据集包含完整的状态‑动作序列、多视角 RGB 图像、LiDAR 点云、深度点云 以及对应的
1,791 downloads · cc-by-4.0
SuSuInterActs Dataset A Large-Scale Multimodal Dialogue Corpus with Synchronized Speech, Full-Body Motion, and Facial Expressions From the paper: SentiAvatar: Towards Expressive and Interactive Digita
1,779 downloads · apache-2.0
A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting Ziyuan Xia • Jingyi Xu • Chong Cui • Yuanhong Yu• Jiazhao Zhang • Qingsong Yan • Tao Ni Junbo Chen • Xiaowei Zhou • Hujun Bao • Rui
1,773 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19382 (别碰我爪子) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56
1,770 downloads · cc-by-nc-nd-4.0
Dataset Card for Dataset Name MetaFold Dataset is a point-cloud trajectory dataset designed for multi-category garment folding tasks in robotic manipulation. Dataset Details Dataset Description Curate
1,761 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100_follower", "total_episodes": 50, "total_frames": 11939, "total_tasks": 1, "
1,753 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1183, "total_frames": 3102621, "total_tasks
1,747 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "sawyer", "total_episodes": 1796, "total_frames": 168423, "total_tasks": 3, "total
1,741 downloads · apache-2.0
This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.
1,653 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 141, "total_frames": 306425, "total_tasks": 1
1,652 downloads · apache-2.0
LeWAM Community Dataset (Preprocessed) A large-scale community-contributed robotics dataset for robot learning research featuring 318 datasets from the LeRobot community. Derived from Community Datase
1,644 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 2, "total_frames": 309, "total_tasks": 1, "chunks_
1,631 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks
1,628 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19380 (MAEN) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-
1,589 downloads · apache-2.0
VLA-Arena Dataset (L0 - Large Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models. It provides a complete and u
1,587 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 97939, "total_tasks": 5, "total
1,568 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19361 (DynamicX) Competition task: 环套在柱子上 (Put Ring onto Rod) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimen
1,563 downloads · mit
BiPlay contains 9.7 hours of bimanual data collected with an aloha robot at the RAIL lab @ UC Berkeley, USA. It contains 7023 clips, 2000 language annotations and 326 unique scenes. Paper: https://hug
1,541 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1085, "total_frames": 77965, "total_tasks": 89, "tota
1,539 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101499, "total_tasks": 10, "total_
1,537 downloads · apache-2.0
Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Ti
1,511 downloads · cc-by-4.0
Raw Data for HuMI [Project Page] | [Paper] Dataset Summary This dataset contains raw data collected using the HuMI data collection pipeline. It includes MP4 videos and tracker trajectories. The datase
1,489 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "tota
1,488 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks
1,477 downloads · cc-by-4.0
PhysicalAI Robotics Manipulation in the Kitchen Dataset Description: PhysicalAI-Robotics-Manipulation-Kitchen is a dataset of automatic generated motions of robots performing operations such as openin
1,477 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974,
1,476 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
1,471 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 100000, "total_tasks": 1, "chun
1,464 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 3958, "total_frames": 574553, "total_tasks": 89, "total
1,451 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19234 (Apex) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-
1,450 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks
1,445 downloads · apache-2.0
This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 79017, "to
1,440 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10. All datasets were taken fro
1,436 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1197, "total_frames": 3146294, "total_tasks
1,429 downloads · not specified
RobotInter-VQA: Intermediate Representation Understanding & Generation VQA Dataset for Manipulation English | 简体中文 A Visual Question Answering dataset for robotic manipulation, developed as part of th
1,427 downloads · mit
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website | 🤗 Assets | 🤗 Model | 📁 Training Dataset 🔥 Overview This repository contains the offici
1,419 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
1,417 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total
1,415 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3603, "total_frames": 237798, "total_tasks": 406, "to
1,395 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
1,385 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 5, "total_frames": 1153, "total_tasks": 1, "total_video
1,385 downloads · apache-2.0
GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobo
1,346 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 132, "total_frames": 273043, "total_tasks": 1
1,343 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
1,335 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "jaco_2", "total_episodes": 976, "total_frames": 70127, "total_tasks": 88, "total_
1,329 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 5124, "total_frames": 308918, "total_tasks": 389, "tota
1,321 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 50, "total_frames": 19631, "total_tasks": 1, "total_vid
1,316 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
1,301 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total
1,299 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 80, "total_frames": 70277, "total_tasks": 2, "
1,293 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "chunks_
1,290 downloads · mit
Active Perception AAWR Dataset in GRASP Lab Mock Kitchen Paper | Project page | Code Here is our full dataset (13.5GB) in DROID format: https://huggingface.co/datasets/Everloom/AAWR-DROID/ It includes
1,281 downloads · cc-by-4.0
SAGE-3D VLN Data: Vision-Language Navigation Dataset with Hierarchical Instructions Paper | Project Page | Code A comprehensive VLN dataset featuring 2 million trajectory-instruction pairs across 1,00
1,281 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 1345076, "total_tasks": 72, "ch
1,277 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
1,275 downloads · cc-by-sa-4.0
BAPData (EFM-10 Benchmark) Project Website | Paper BAPData is a dataset introduced in the paper "Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark
1,273 downloads · mit
The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments,
1,263 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 902, "total_frames": 294139, "total_tasks": 1, "total_
1,255 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "koch", "total_episodes": 51, "total_frames": 16602, "total_tasks": 1, "total_vide
1,247 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Converted BridgeV2 dataset to lerobot format. Downloaded from [https://rail.eecs.berkeley.edu/datasets/bridge_release/data/] (only demos_8_1
1,236 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 454, "total_frames": 66984, "total_tasks": 10, "chunks_
1,227 downloads · apache-2.0
Датасет создан при помощи библиотеки LeRobot. Описание датасета Русскоязычная версия данного датасета объединяет 598 открытых датасетов сообщества в единый унифицированный корпус, включающий 22 709 эп
1,220 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
1,219 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3000, "total_frames": 149985, "total_tasks": 1, "tota
1,211 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 18, "total_frames": 67500, "total_tasks": 1, "chunks_si
1,202 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_
1,192 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 631, "total_frames": 146241, "total_tasks": 7, "total_
1,179 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 4444, "total_frames": 930500, "total_tasks": 1424, "tot
1,179 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1706, "total_frames": 71623, "total_tasks": 201, "total
1,174 downloads · other
MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoFullBody musculoskeletal model, generated using General Motion Retargeting (GMR). Dataset Groups Group Motions Descript
1,159 downloads · apache-2.0
TraceSpatial-Bench: An Object-Centric 3D Trajectory Planning Benchmark Welcome to TraceSpatial-Bench, an object-centric 3D spatial trace planning benchmark provided by RoboTracer.TraceSpatial-Bench is
1,156 downloads · mit
SafeFlowMPC Finetuning Dataset This repository contains the finetuning dataset used in the paper SafeFlowMPC: Predictive and Safe Trajectory Planning for Robot Manipulators with Learning-based Policie
1,143 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
1,138 downloads · cc-by-nc-sa-4.0
TAMEn: Tactile-Aware Manipulation Engine for Closed-Loop Data Collection in Contact-Rich Tasks 🎯 Overview TAMEn builds upon the UMI paradigm with key enhancements in multimodality, precision-portabil
1,131 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 2460, "total_frames": 112980, "total_tasks": 9, "tota
1,123 downloads · apache-2.0
GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobo
1,120 downloads · mit
VITRA Teleoperation Dataset Dataset Summary This dataset contains real-world robot teleoperation demonstrations collected using a 7-DoF robotic arm equipped with a dexterous hand and a head-mounted RG
1,109 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 456, "total_frames": 44875, "total_tasks": 1, "total_
1,100 downloads · mit
OmniRetarget Dataset: Humanoid Loco-Manipulation & Scene Interaction Paper | Project Page This dataset contains motion trajectories of a G1 humanoid robot interacting with objects and complex terrains
1,098 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
1,096 downloads · cc-by-nc-4.0
Website | Model | Dataset | Code | Paper NitroGen Dataset Dataset Description: The NitroGen dataset contains action annotations for publicly available gameplay videos. Specifically, we used an in-hous
1,095 downloads · not specified
This repository contains the VLA-IT dataset, a curated 650K-sample Vision-Language-Action Instruction Tuning dataset, and the SimplerEnv-Instruct benchmark. These are presented in the paper InstructVL
1,092 downloads · cc-by-4.0
BridgeData V2 Teleoperated Demos Teleoperated demonstrations in BridgeData V2. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~86% of all teleoperated episode
1,092 downloads · not specified
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 151, "total_frames": 89646, "total
1,081 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 415, "total_frames": 50744, "total_tasks": 10, "total_
1,076 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks
1,066 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 537, "total_frames": 1835987, "total_tasks"
1,059 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description This dataset is a lerobot conversion of the aloha_pen_uncap_diverse subset of BiPlay. BiPlay contains 9.7 hours of bimanual data collected w
1,056 downloads · cc-by-4.0
GraspGen: Scaling Sim2Real Grasping GraspGen is a large-scale simulated grasp dataset for multiple robot embodiments and grippers. We release over 57 million grasps, computed for a subset of 8515 obje
1,046 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1482, "total_frames": 38240, "total_tasks": 1, "total_
1,043 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 297520, "total_tasks": 500, "chunk
1,040 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total_
1,039 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 11834, "total_frames": 610907, "total_tasks": 1, "chu
1,030 downloads · cc-by-4.0
This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Citation
1,028 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total_
1,016 downloads · apache-2.0
This dataset was created using LeRobot. It contains embodied Chain-of-Thought (CoT) demonstrations for the LIBERO benchmark, featuring paired reasoning and action traces. It was curated as part of the
1,013 downloads · cc-by-nc-sa-4.0
360DVO Dataset This is the official dataset from the paper 360DVO: Deep Visual Odometry for Monocular 360-Degree Camera, which is published on IEEE Robotics and Automation Letters (RA-L) 2026. Project
1,012 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 2500, "total_frames": 549787, "total_tasks": 2409, "chu
1,002 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
1,000 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
998 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
989 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
985 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "tota
985 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "robocasa", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 50,
980 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "total
980 downloads · apache-2.0
Pick & Place Cube Dataset Collected for Agibot G2 robot
972 downloads · cc-by-nc-sa-4.0
Viewer is explicitly configured to read the parquet split only. Citation If you find this dataset useful, please cite: @article{zhao2026SythnVerse, title={SynthVerse: A Large-Scale Diverse Synthetic D
972 downloads · apache-2.0
This dataset was created with the IsaacLab-WireManagement scripted agent. Total time: 45 minutes 56.85 seconds to generate 506 episodes. Visualize this dataset Dataset Structure Total episodes: 506 To
957 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 2955, "total_frames": 241059, "total_tasks": 1, "chun
949 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total
948 downloads · mit
Raw GoPro Videos for Four Robotic Manipulation Tasks [Project Page] [Paper] [Code] [Models] [Processed Dataset] This repository contains raw GoPro videos of robotic manipulation tasks collected in-the
948 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 150, "total_frames": 7800, "total_tasks": 1, "chunks_size"
947 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 484, "total_frames": 20405, "total_tasks": 1, "total_
945 downloads · apache-2.0
WildOS Frontiers Dataset Dataset Description This dataset provides visual frontier annotations for outdoor long-range navigation, created for WildOS: Open-Vocabulary Object Search in the Wild. The ann
927 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 499, "total_tasks": 1, "chunks
913 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total_
900 downloads · cc-by-sa-4.0
GraspClutter6D Dataset GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes Seunghyeok Back, Joosoon Lee, Kangmin Kim, Heeseon Rho, Geonhyup Lee, Rae
899 downloads · apache-2.0
DROID 1.0.1 — Preprocessed Mirror A ready-to-use mirror of lerobot/droid_1.0.1 with two preprocessing fixes baked in: Fixed meta/episodes/*.parquet timestamps. The upstream release stores videos/*/fro
894 downloads · apache-2.0
ManiSkill2 Data Update: ManiSkill 3 has been released https://github.com/haosulab/ManiSkill/. It uses different datasets than ManiSkill2 so the data here is not expected to transfer over ManiSkill2 is
893 downloads · mit
Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to
887 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 2, "total_frames": 3394, "total_tasks": 1, "total_videos":
886 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_vid
880 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 139, "total_frames": 8277, "total_tasks": 3, "chunks_
875 downloads · mit
ManiSkill-HAB TidyHouse Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB TidyHouse. Dataset D
874 downloads · apache-2.0
DK-1 Merged Dataset This dataset was created using LeRobot. Dataset Description Merged and deduplicated DK-1 bimanual robot dataset. All source videos are re-encoded to 640x360 h264 with 14D joint-spa
872 downloads · apache-2.0
Dataset Summary This dataset provides shards in the WebDataset format for fine-tuning RDT-2 or other policy models on bimanual manipulation. Each sample packs: a binocular RGB image (left + right wris
867 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
859 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1804, "total_frames": 338188, "total_tasks": 24, "chu
850 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 1, "total_frames": 1358, "total_tasks": 1, "to
846 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 60000, "total_tasks": 1, "chunks
835 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Panda", "total_episodes": 108, "total_frames": 44097, "total_tasks": 1, "total_vi
831 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120066, "total_tasks": 1, "total_v
826 downloads · cdla-permissive-2.0
MOSAIC Dataset Project Page | Paper | Code | Dataset | Model This repository releases the built-in MOSAIC multi-source motion dataset in the following paper: MOSAIC: Bridging the Sim-to-Real Gap in Ge
811 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 290, "total_frames": 814263, "total_tasks": 7, "
809 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 356, "total_frames": 103656, "total_tasks": 16, "chunk
805 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_
805 downloads · apache-2.0
RoboTwin2.0 - LeRobot v3.0 RoboTwin2.0 converted to LeRobot v3.0 format. Contains both joint-space and end-effector (EE) pose data for bimanual manipulation across 5 robot embodiments. Dataset Statist
803 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total_
803 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 235, "total_frames": 224240, "tota
802 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
796 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Gripper", "total_episodes": 201, "total_frames": 172649, "total_tasks"
789 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 100, "total_frames": 147066, "total_tasks"
788 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Repository: X-VLA License: Apache 2.0 Paper: Zheng et al., 2025, “X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Langu
784 downloads · mit
The ROVER Visual SLAM Benchmark Paper News [2024/12/05] Initial code release. [2025/05/20] ROVER is accepted to IEEE Transactions on Robotics. [2025/05/25] Dataset released on HuggingFace, see Utility
783 downloads · apache-2.0
Simulated Franka Pick-Cube Tactile Dataset The dataset was generated using the Robot Control Stack (RCS). RCS is a flexible Gymnasium wrapper-based robot control interface made for robot learning and
767 downloads · mit
Dataset Card for BEHAVIOR Robot Suite (BRS) Data This dataset provides robotic trajectories for five real-world household tasks. These tasks are: Clean house after a wild party; Clean the toilet; Take
763 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 135, "total_frames": 68913, "total_tasks": 3, "total_v
758 downloads · mit
Dataset for Embodied Chain-of-Thought Reasoning for LIBERO-90, as used by ECoT-Lite. TFDS Demonstration Data The TFDS dataset contains successful demonstration trajectories for LIBERO-90 (50 trajector
755 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "uav", "total_episodes": 799, "total_frames": 79900, "total_tasks": 2, "total_vide
751 downloads · cc-by-4.0
Dataset Description The Physical AI NuRec dataset seeks to empower robotic researchers to build the next generation of physical AI based end-to-end robotic models. This dataset includes various 3DGUT
751 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunk
739 downloads · mit
Colosseum Dataset Card This dataset contains demonstrations for training and testing Imitation Learning based policies, taken from our simulation benchmark Colosseum, which is based on RLBench. The be
739 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 200, "total_frames": 47865, "total_tasks": 1, "
736 downloads · apache-2.0
This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018]. If you wa
731 downloads · apache-2.0
Community Dataset v1 (v3.0) A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 119 datasets from 52 contributors worldwide. This is a converted and cur
726 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1355, "total_frames": 67750, "total_tasks": 3, "total
724 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 50, "total_frames": 52868, "total_tasks": 1, "
714 downloads · mit
🌊 OceanGym 🦾 A Benchmark Environment for Underwater Embodied Agents 🌐 Home Page 📄 ArXiv Paper 🤗 Hugging Face ☁️ Google Drive ☁️ Baidu Drive OceanGym is a high-fidelity embodied underwater environ
709 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t
707 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "total_
698 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "total_
695 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "total_vid
694 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 532, "total_frames": 1106053, "total_tasks":
691 downloads · apache-2.0
Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world roboti
687 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
686 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 108, "total_frames": 16645, "total_tasks": 1, "ch
683 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19359 (炬亮启航) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (56-
679 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
678 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 428, "total_frames": 52042, "total_tasks": 10, "chunks_
675 downloads · mit
IndustryShapes Project Page | Paper IndustryShapes is a new benchmark dataset tailored for 6D object pose estimation in industrial settings. Targeting the challenges of textureless objects, reflective
672 downloads · apache-2.0
SO101 Pick Cube Dataset (Chunked) This is a restructured version of the gpudad/so101_pick_cube dataset with episode-level video files for faster data loading during training. Why Chunked? The original
671 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 12
668 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 150, "total_frames": 3970, "total_tasks": 8, "total_v
668 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "tot
668 downloads · not specified
PhysicalAI-Robotics-Manipulation-Objects is a dataset of automatic generated motions of robots performing operations such as picking and placing objects in a kitchen environment. The dataset was gener
662 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "R1Pro", "total_episodes": 2000, "total_frames": 21227314, "total_tasks": 10, "chu
662 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630,
658 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 37, "total_frames": 36234, "total_tasks": 1, "chu
658 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19353 (模型学的全队) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (5
655 downloads · not specified
Datasets for OneTwoVLA [Project Page] | [Paper] | [Code] This repository provides datasets collected with the UMI, converted into the LeRobot data format, along with synthetic vision-language data use
654 downloads · cc-by-4.0
Functional Manipulation Benchmark This robot learning dataset is a part of the paper "FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning". It includes 22,550 expert demonstrat
652 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 29400, "total_tasks": 1, "chunks_
650 downloads · cc-by-4.0
This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Citation
649 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "total
647 downloads · mit
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark 📄 arXiv | 🐙 GitHub | 💾 Baidu Netdisk | 🤗 Hugging Face Griffin is the pioneering publicly available dataset for aeria
645 downloads · cc-by-4.0
NaviTrace 🏠 Project 📄 Paper 💻 Code 🏆 Leaderboard NaviTrace is a novel VQA benchmark for VLMs that evaluates models on their embodiment-specific understanding of navigation across challenging real-
642 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1447, "total_frames": 699432, "total_tasks": 1, "chun
641 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
638 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Dual", "total_episodes": 254, "total_frames": 178104, "total_tasks": 1
636 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 39693, "total_tasks": 1, "chunks
634 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
633 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 32708, "total_tasks": 3, "total_
633 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 51, "total_frames": 16267, "total_tasks": 1, "
631 downloads · mit
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website 🔥 Overview This repository contains the official implementation and benchmark for our paper
627 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 36013, "total_tasks": 1, "chunks_
625 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "tot
623 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks
622 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 365, "total_frames": 34448, "total_tasks": 1, "total_v
622 downloads · cc-by-4.0
Dataset Description: The Arena-G1-Loco-Manipulation-Task dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (G1) loco-manipulation task in IsaacLab-Arena en
622 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 61, "total_frames": 120265, "total_tasks": 1,
621 downloads · not specified
This data is a subset of the pretraining data for Being-H0.5. Citation Being-H0.5 @article{beingbeyond2026beingh05, title={Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment General
616 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 101, "total_frames": 45500, "total_tasks": 1, "total_vi
613 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1995, "total_frames": 187507, "total_tasks": 3, "chun
611 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 53, "total_frames": 47059, "total_tasks": 1, "chu
610 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 199, "total_frames": 1990, "total_tasks": 3, "total_v
608 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 554, "total_frames": 428498, "total_tasks": 11
597 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "reachy2", "total_episodes": 25, "total_frames": 6652, "total_tasks": 1, "chunks_s
596 downloads · mit
Dataset Card for GEMBench dataset 💎 GEneralizable vision-language robotic Manipulation Benchmark Dataset A benchmark to systematically evaluate generalization capabilities of vision-and-language robo
587 downloads · other
MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoBimanualArm musculoskeletal model, generated using General Motion Retargeting (GMR). Dataset Groups Group Motions Descr
586 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 413, "total_frames": 49801, "total_tasks": 10, "total_
583 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1003, "total_frames": 325699, "total_tasks":1, "total
583 downloads · mit
Minecraft VLA Stage 1: Action Pretraining Data Vision-Language-Action training data for Minecraft, processed from OpenAI's VPT contractor dataset. Dataset Description This dataset contains frame-actio
580 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t
575 downloads · apache-2.0
ArtVIP dataset card Key Features ✅ 206 high-quality digital-twin articulated objects. ✅ 6 pre-configured digital-twin scenes, 6 user-defined scenes ✅ Reusable Modular interaction ✅ Physics fidelity ✅
566 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 631, "total_frames": 1055899, "total_tasks"
564 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_vid
564 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "chunks_si
564 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 7331, "total_frames": 156012, "total_tasks": 1, "chun
563 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "c
562 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 100, "total_frames": 50000, "total_tasks": 1, "total_vi
562 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_leader", "total_episodes": 200, "total_frames": 47567, "total_tasks": 1, "c
561 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 170, "total_frames": 7148, "total_tasks": 17, "chunks
561 downloads · mit
Dataset of Reactive Diffusion Policy Contents Description Structure Usage Tactile Dataset Description This is the raw and postprocessed dataset used in the paper Reactive Diffusion Policy: Slow-Fast V
555 downloads · cc-by-4.0
bridge_train_0_5000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,417 Splits: train: 0
555 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "libero_panda", "total_episodes": 500, "total_frames": 138090, "total_tasks": 10,
554 downloads · cc-by-4.0
Rethinking Video Generation Model for the Embodied World 🔍 Benchmark Overview The benchmark is constructed from two complementary perspectives: task categories and robot embodiment types, covering a
549 downloads · mit
Nemotron-VLA MetaWorld Expert Demonstrations Expert demonstration dataset for training Vision-Language-Action (VLA) models on MetaWorld robot manipulation tasks. Built for the Nemotron-VLA project. Da
539 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "blueberry_ros", "total_episodes": 80, "total_frames": 29986, "total_tasks": 1, "c
536 downloads · not specified
Humanoid-X 🌐 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repo contains the officail dataset for the paper "Learning from Massive Human Videos for Universal Humanoid Pose Control" If yo
533 downloads · mit
UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories Yanghong Mei*1,5, Yirong Yang*2, Longteng Guo†1, Qunbo Wang3, Ming-Ming Yu2, Xingjian He1, Wenjun Wu2,4, Jing Liu1
531 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_vid
530 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 136, "total_frames": 27808, "total_tasks": 1, "total_
529 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 201, "total_frames": 32429, "total_tasks": 193, "tota
528 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 435, "total_frames": 18196, "total_tasks": 1, "
527 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_si
524 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_si
523 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c
523 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "cta_ur_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "tota
509 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 337627, "total_tasks": 72, "chu
509 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks
507 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 100, "total_frames": 55127, "total_tasks": 10, "total_vide
505 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "chunks_
502 downloads · not specified
This dataset was created using 🤗 LeRobot.
500 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19159 (Robot大王) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimensions (
499 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 55, "total_frames": 110000, "total_tasks": 1, "total_vi
499 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 7490, "total_tasks": 1, "chunks_
499 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 258, "total_frames": 595888, "total_tasks":
497 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 56, "total_frames": 16800, "total_tasks": 1, "total_vid
496 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 25, "total_frames": 8750, "total_tasks": 1, "total_vide
496 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 201, "total_frames": 234959, "total_tasks"
493 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 34300, "total_tasks": 1, "chunks_
491 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
488 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ur5e_aic", "total_episodes": 50, "total_frames": 91714, "total_tasks": 1, "chunks
486 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_vid
483 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c
481 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
479 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 601, "total_frames": 637163, "total_tasks": 1,
478 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 10, "total_frames": 4539, "total_tasks": 1, "total_vide
478 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 980, "total_frames": 106944, "total_tasks": 10, "total_
476 downloads · apache-2.0
This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018]. If you wa
474 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "fanuc_mate", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "to
474 downloads · not specified
GATE-VLAP Datasets Grounded Action Trajectory Embeddings with Vision-Language Action Planning This repository contains preprocessed datasets from the LIBERO benchmark suite in WebDataset TAR format, s
472 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 598, "total_frames": 285571, "total_tasks": 9, "total_v
472 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120081, "total_tasks": 1, "total_v
471 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 304, "total_frames": 302309, "total_tasks": 1,
468 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 60, "total_frames": 36293, "total_tasks": 1, "
464 downloads · apache-2.0
Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets acro
463 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 40, "total_frames": 22683, "total_tasks": 1, "chu
462 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 200, "total_frames": 30000, "total_tasks": 1, "total_video
462 downloads · apache-2.0
OpenArm Pick v6 - LeRobot Dataset A LeRobot v2.1 dataset for bimanual robot manipulation, recorded in Isaac Sim with teleoperation. Dataset Description This dataset contains teleoperated demonstration
458 downloads · apache-2.0
This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018]. If you wa
457 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Single", "total_episodes": 1265, "total_frames": 989335, "total_tasks"
456 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 80, "total_frames": 24266, "total_tasks": 1, "c
451 downloads · apache-2.0
Psi0 Apple-to-Plate VR Teleoperation Dataset Human-demonstrated loco-manipulation trajectories for fine-tuning the Psi0 Vision-Language-Action (VLA) model on a Unitree G1 humanoid robot in Isaac Lab s
450 downloads · not specified
From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning (DICE-RL) This repository contains the datasets used in the paper From Prior to Pro: Efficient Skill Mastery via D
450 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks
450 downloads · apache-2.0
teleop-piper-0411 Teleoperation dataset for the Agilex Piper robotic arm, collected April 11, 2026 across 3 rigs. Dual-camera (ego wrist + exo third-person), LeRobot v2 format. Overview Episodes 207 T
446 downloads · mit
Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to
446 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "c
445 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 221, "total_frames": 638395, "total_tasks": 7, "
443 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596,
443 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 338, "total_frames": 364063, "total_tasks": 11
443 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "t
442 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "total
441 downloads · not specified
Robotic Manipulation Datasets for Four Tasks [Project Page] [Paper] [Code] [Models] [Raw GoPro Videos] This repository contains in-the-wild robotic manipulation datasets collected using UMI, and proce
433 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 65000, "total_tasks": 1, "total_vid
432 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "aloha", "total_episodes": 10, "total_frames": 2568, "total_tasks": 1, "total_vide
431 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 390, "total_frames": 219625, "total_tasks": 19, "total_
431 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_flexiv_rizon4_rt", "total_episodes": 35, "total_frames": 315631, "total_tasks"
426 downloads · mit
Human2LocoMan: Learning Versatile Quadrupedal Manipulation with Human Pretraining Yaru Niu1,* Yunzhe Zhang1,* Mingyang Yu1 Changyi Lin1 Chenhao Li1 Yikai Wang1 Yuxiang Yang2 Wenhao Yu2 Tingnan Zhang2
423 downloads · cc-by-4.0
Comprehensive Robotic Cholecystectomy Dataset (CRCD) The Comprehensive Robotic Cholecystectomy Dataset (CRCD) is a large-scale, multimodal dataset for robot-assisted surgery (RAS) research.It provides
420 downloads · cc-by-nc-sa-4.0
DexMimicGen Datasets This repository contains the official dataset release of simulation environments and datasets for the ICRA 2025 paper "DexMimicGen: Automated Data Generation for Bimanual Dexterou
419 downloads · not specified
agibot_task_463 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 清洁微波炉和抽油烟机。 total_episodes: 1780 total_tasks: 1 size: 57G
411 downloads · not specified
agibot_task_537 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 整理床铺 total_episodes: 751 total_tasks: 1 size: 64G Dataset
411 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "chun
410 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
410 downloads · apache-2.0
notes to self: this is using the faster teleop (aka no velocity limiting) This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lerobot
409 downloads · not specified
HumanoidGen-Dataset 🔥 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repository provides datasets and resources for the HumanoidGen framework, enabling bimanual dexterous manipulation res
408 downloads · mit
ManiSkill-HAB SetTable Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB SetTable. Dataset Det
407 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 34112, "total_tasks": 1, "total_v
407 downloads · apache-2.0
Language Table (LeRobot) — Task-Pruned, Reindexed Subset This release is a task-pruned subset of the original IPEC-COMMUNITY/language_table_lerobot. We subsampled by task text and rebuilt the package
407 downloads · cc-by-nc-4.0
Dynamic Intelligence — Humanoid Robot Training Dataset A first-person (egocentric) video dataset of human hand manipulation, designed for training humanoid robot policies via imitation learning. Each
406 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ros2", "total_episodes": 357, "total_frames": 334477, "total_tasks": 4, "chunks_s
406 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks
404 downloads · cc-by-4.0
bridge_train_20000_25460_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,460 Frames: 185,428 Splits: tra
403 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ffw_sh5", "total_episodes": 2, "total_frames": 184, "total_tasks": 1, "total_vide
402 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 10, "total_frames": 6000, "total_tasks": 1, "total_vide
400 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 106292, "total_tasks": 1, "total_vi
399 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunk
397 downloads · apache-2.0
EMMOE-100 Trainset Resources Project Paper Code Model Dataset Dataset Feature Task Attributes Task Example Dataset Structure EMMOE-100/ ├── README.md ├── assets/ ├── data/ │ └── train/ │ ├── 1/ │ │ ├─
396 downloads · apache-2.0
This dataset is used for the demonstration of the Vision Language Action model finetuning. It is collected using a modified version of LeKiwi with 3 cameras, but technically, only the arm is used, so
394 downloads · apache-2.0
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the
390 downloads · apache-2.0
RBM-1M-OOD evaluation dataset used in Robometer. It contains over 1k trajectories used for evaluation of general-purpose reward models. Dataset Description Official evaluation in the paper uses only t
388 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 222, "total_frames": 384490, "total_tasks": 1,
387 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 80, "total_frames": 11522, "total_tasks": 1, "total_v
386 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 2, "total_frames": 1731, "total_tasks":1, "total_videos
386 downloads · apache-2.0
LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original? The original dataset
383 downloads · not specified
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 49, "total_frames": 23490, "total_tasks": 1, "total_vid
383 downloads · apache-2.0
merged_diffrenet_grips Dataset for RECAP (RL with Experience and Corrections via Advantage-conditioned Policies) training. Features Feature Type Shape action float32 (14,) observation.state float32 (1
382 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 75000, "total_tasks": 1, "total_vid
382 downloads · cc-by-4.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 70, "total_frames": 1514, "total_tasks": 2, "total_vi
381 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
378 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total
378 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100", "total_episodes": 56, "total_frames": 22956, "total_tasks": 1, "chunks_si
375 downloads · apache-2.0
LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original? The original dataset
374 downloads · apache-2.0
Maniskill Sub-Dataset in RLDS Format used in RPD This repository contains the maniskill subset in RLDS format used to train Octo and OpenVLA in the paper Refined Policy Distillation: From VLA Generali
373 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 101, "total_frames": 21006, "total_tasks": 4, "chunks_
372 downloads · apache-2.0
This dataset was created using LeRobot. 1 Dataset Description This dataset includes 117 Lightwheel-Libero-Tasks and 89 Lightwheel-Robocasa-Tasks, collected using the x7s robot in environments provided
372 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Unitree_G1_Dex3", "total_episodes": 124, "total_frames": 13344, "total_tasks": 3,
370 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "
370 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Data Distribution Overview This figure summarizes the data distribution of the ywxia/fold_combined_gt dataset, auto-generated after each con
369 downloads · cc-by-4.0
berkeley_autolab_ur5_train_400_500 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,479 Videos: 900
368 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "omx_f", "total_episodes": 99, "total_frames": 14858, "total_tasks": 1, "total_vid
368 downloads · cc-by-nc-4.0
Exylos Pick & Place Sample Dataset Summary Exylos Pick & Place Sample is a compact multi-view robotics dataset in LeRobot-style structure for a single manipulation task: pick up an object and place it
366 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5", "total_episodes": 164, "total_frames": 34029, "total_tasks": 1, "chunks_si
364 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 51, "total_frames": 22577, "total_tasks": 1, "chu
363 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 200, "total_frames": 33587, "total_tasks": 40, "total_
363 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 212, "total_frames": 101703, "total_tasks": 2, "total_vide
362 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 340, "total_frames": 289608, "total_tasks": 9,
360 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 1
360 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 221, "total_frames": 203035, "total_tasks": 1
357 downloads · mit
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos We introduce Being-H0, the first dexterous Vision-Language-Action model pretrained from large-scale human videos via explicit
357 downloads · cc-by-4.0
iamlab_cmu_pickup_insert_train_500_631_augmented Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, sawyer, ur5e, widowX, xarm7 FPS: 20.0 Episodes: 131 Frames: 30,
357 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Bimanual duplo disassembly demonstrations with DK1 robot Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: sepa
357 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 135, "total_frames": 65250, "total_tasks": 5, "ch
355 downloads · cc-by-4.0
berkeley_autolab_ur5_train_300_400 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,934 Videos: 900
355 downloads · apache-2.0
Android Control (Community Mirror) This is a community mirror of the official Android Control dataset by Google Research, hosted on Hugging Face for easier access. The original dataset is hosted on Go
354 downloads · cc-by-4.0
bridge_test_0_3475_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 3,475 Frames: 118,603 Splits: train: 0:
354 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 751, "total_tasks": 1, "chunks
352 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks":
351 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 1860, "total_frames": 0, "total_tasks": 0, "total_videos":
351 downloads · cc-by-4.0
bridge_train_15000_20000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 169,127 Splits: tra
351 downloads · apache-2.0
Bright 2026 Benchmark - Robot 19153 (同济子豪兄 (张子豪团队)) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist). Action / State Dimens
350 downloads · cc-by-4.0
bridge_train_5000_10000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 168,737 Splits: trai
349 downloads · apache-2.0
FR5 Garlic Manipulation Dataset This dataset contains demonstrations collected on an FR5 robot teleoperated to manipulate garlic. Each episode pairs synchronized joint trajectories (observation.state,
348 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "dianarm", "total_episodes": 26, "total_frames": 6847, "total_tasks": 1, "chunks_s
347 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 110, "total_frames": 26113, "total_tasks": 216, "tota
346 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "c
345 downloads · cc-by-nc-nd-4.0
V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception Lei Yang · Xinyu Zhang · Jun Li · Chen Wang · Jiaqi Ma · Zhiying Song · Tong Zhao · Ziying Song · Li Wang · Mo Zhou · Yang She
345 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 30, "total_frames": 27724, "total_tasks": 2, "
344 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 9, "total_frames": 289, "total_tasks": 2, "total_videos
343 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 223728, "total_tasks": 343, "chunk
343 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 111, "total_frames": 62196, "total
342 downloads · apache-2.0
🤖 Custom LeRobot Dataset This dataset was created using LeRobot,and follows the LeRobot v2.1 dataset specification for robotic control and imitation learning. 📘 Dataset Description Codebase version:
342 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 101, "total_frames": 247024, "total_tasks": 2,
342 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 18
342 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ur5", "total_episodes": 100, "total_frames": 12696, "total_tasks": 1, "total_vide
340 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 303, "total_frames": 373511, "total_tasks": 1,
339 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 107, "total_frames": 7622, "total_tasks": 1, "total_v
339 downloads · apache-2.0
VLA-Arena Dataset (L0 - Small Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models. It provides a complete and u
339 downloads · apache-2.0
VLA-Arena Dataset (L1 - Medium Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models. It provides a complete and
339 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9,
338 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9,
337 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 128, "total_frames": 74668, "total
335 downloads · cc-by-4.0
Dataset Description: This dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (GR1) tabletop manipulation tasks for industrial settings. Each dataset entry p
335 downloads · cc-by-4.0
bridge_train_10000_15000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,583 Splits: tra
334 downloads · mit
ManiSkill-HAB PrepareGroceries Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB PrepareGrocer
333 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 84994, "total_tasks": 1, "total_vid
333 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 197, "total_frames": 220788, "total_tasks"
331 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "t
330 downloads · cc-by-4.0
berkeley_autolab_ur5_train_800_896 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 96 Frames: 9,360 Videos: 864
329 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_
326 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 240, "total_frames": 180027, "total_tasks": 1,
326 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 10977, "total_frames": 3114872, "total_tasks": 295, "c
325 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 130, "total_frames": 4435, "total_tasks": 36, "chunks_size
325 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_
325 downloads · apache-2.0
bi-so101-insert-screw-271ep Bimanual teleoperation dataset for the task "Insert the copper screw into the black sleeve", collected with SO-101 dual arms and LeRobot 0.5.0. Dataset Summary Field Value
324 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 59, "total_frames": 20844, "total_tasks": 2, "
324 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 272, "total_frames": 11334, "total_tasks": 32, "total_v
324 downloads · apache-2.0
real_so101_record_v1 A real-world LeRobot-format dataset collected on an SO101/SO100-style follower arm with three camera views. Summary This dataset contains robot demonstrations recorded with lerobo
322 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "arx5", "total_episodes": 60, "total_frames": 41832, "total_tasks": 1, "total_vide
322 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_mobile", "total_episodes": 48, "total_fr
321 downloads · cc-by-4.0
berkeley_autolab_ur5_train_0_100 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,863 Videos: 900 C
321 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 75051, "total_tasks": 1, "total_vi
321 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9,
320 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 12971, "total_tasks": 1, "total_
320 downloads · mit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 104, "total_frames": 8928, "total_tasks": 14, "total_
320 downloads · apache-2.0
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as
320 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks":
320 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 80, "total_frames": 33909, "total_tasks": 26, "chunks_
320 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_stationary", "total_episodes": 1, "total
320 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 226, "total_frames": 120729, "total_tasks": 3,
319 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 70, "total_frames": 42041, "total_tasks": 1, "total_vid
318 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605,
317 downloads · cc-by-4.0
This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Citation
RESEARCH PATHS
A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.
For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.
Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.
INTERNAL LINKS
Use the catalog to compare source-backed dataset profiles by modality, task, rights signal, consent risk, and deployment fit.
Track source updates, licensing notes, and buyer-readiness changes that should trigger a renewed review.
Score whether a public source is enough for the model, rights path, modalities, and target environment.
Separate source license language from contributor consent, redistribution, private-space risk, and model-use assumptions.
Turn a public-source gap into a scoped capture request with sample QA, metadata, and delivery requirements.
Compare data providers when the answer is not another public dataset but a better sourcing or capture route.
Use the company index to separate annotation vendors, data engines, marketplaces, and specialist capture teams.
EXTERNAL REFERENCES
Market context for why physical AI systems need custom, enriched, real-world data beyond generic labeling workflows.
Robotics dataset and tooling context for Hugging Face based collection, sharing, conversion, and training workflows.
A cross-embodiment robotics dataset reference for comparing trajectory scale, robot diversity, and VLA training assumptions.
A large in-the-wild robot manipulation dataset reference for real-world trajectory capture and deployment transfer risk.
TRUELABEL ROUTING
Turn the Hugging Face record into a buyer-ready request with sample QA, rights review, conversion proof, and deployment-fit checks.