Indexed records
1,000 robotics-tagged dataset records.
ROBOTICS DATASET INDEX
A broad source-backed watchlist for physical AI teams comparing robotics datasets, licenses, formats, modalities, and ingestion risk across the Hugging Face Hub.
DIRECT ANSWER
This index tracks 1,000 Hugging Face records tagged for robotics. It is the breadth layer for truelabel: use it to discover candidates, then promote important records into deeper dataset profiles, comparisons, and custom collection specs.
1,000 robotics-tagged dataset records.
5,343,188 combined downloads across the indexed records.
May 1, 2026, 9:28 PM
BREADTH MAP
616 ROBOTICS RECORDS WORTH OPENING
Tier-A and Tier-B records only. The remaining 384 records are consolidated under author clusters.
Showing 50 of 616 datasets
Published Mar 2026 · cc-by-4.0 · nvidia
PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robot embodiments and tasks.
Published Feb 2025 · apache-2.0 · cadene
This dataset was created using LeRobot. DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset One of the biggest open-source dataset for robotics with 27.044,326 frames, 92,223 episodes, 31,308 unique task description in natural…
Published Mar 2025 · apache-2.0 · cadene
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Franka", "total_episodes": 95600, "total_frames": 27612581, "total_tasks": 0, "total_videos": 286800, "total_chunks":…
Published Apr 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 92233, "total_frames": 27044326, "total_tasks": 31308, "total_videos": 276699,…
Published Apr 2026 · cc-by-nc-sa-4.0 · InternRobotics
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling 🎉NEWS [2026.3.21] 🔥 OmniWorld-Game with Metric Scale is now released!
Published Apr 2026 · cc-by-sa-4.0 · genrobot2025
Boasting over 13,000 hours of cumulative data and 5 million+ clips, it ranks as the largest open-source embodied intelligence dataset in the industry. Update Notes:Stage 3 data upload completed.
Published Apr 2026 · other · ropedia-ai
⚠️ Important: If you have already submitted an access request but have not completed the required DocuSign agreement, your request will remain pending. Please complete signing and we will grant access once verified.
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605, "total_videos": 442226, "total_chunks":…
Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…
Published Mar 2026 · cc-by-4.0 · sini-21
PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robot embodiments and tasks.
Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 599, "total_videos": 87212,…
Published Mar 2026 · cc-by-nc-sa-4.0 · balatubs123
KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task_A Single task Initial state: T-shirts…
Published Dec 2025 · mit · behavior-1k
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…
Published Oct 2025 · not specified · agibot-world
Key Features 🔑 1 million+ trajectories from 100 robots, with a total duration of 2976.4 hours. 100+ real-world scenarios across 5 target domains.
Published Mar 2026 · cc-by-nc-4.0 · rad1d1m123
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…
Published Jan 2026 · apache-2.0 · Saberlve
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…
Published Feb 2026 · apache-2.0 · arekborucki
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…
Published Nov 2025 · apache-2.0 · RoboVerseOrg
This dataset is part of the RoboVerse project, as described in the paper RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning.
Published Mar 2026 · cc-by-nc-4.0 · OpenMOSS-Team
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…
Published Nov 2025 · mit · Facebear
Cloth-Folding Dataset for X-VLA Paper This dataset contains 1,500 episodes of cloth folding, collected using Agilex's robotic arm.
Published Apr 2026 · cc-by-nc-sa-4.0 · agibot-world
AgiBot World 2026 Real-World Embodied Intelligence Dataset Overview As robotics research advances into real-world scenarios, the demand for authentic, high-quality data has become increasingly urgent.
Published Feb 2026 · cc-by-sa-4.0 · InternRobotics
InternData-N1 InternData-N1 is a large-scale, unified vision-language navigation dataset consolidating multiple benchmarks into a standardized format.
Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "kuka_iiwa", "total_episodes": 209880, "total_frames": 2455879, "total_tasks": 1, "total_videos": 209880, "total_chunks":…
Published Apr 2026 · cc-by-nc-4.0 · ManiTwin
ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research.
Published Feb 2025 · cc-by-4.0 · physical-intelligence
This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10.
Published Apr 2026 · cc-by-4.0 · nvidia
Dataset Description: Open-H-Embodiment is a community‑driven dataset initiative building the open, shared foundation needed to train and evaluate AI autonomy models for surgical robotics and ultrasound.
Published Mar 2026 · cc-by-4.0 · REXX-NEW
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Mar 2026 · not specified · InternRobotics
InternData-A1 InternData-A1 is a hybrid synthetic-real manipulation dataset containing over 630k trajectories and 7,433 hours across 4 embodiments, 18 skills, 70 tasks, and 227 scenes, covering rigid, articulated, deformable, and…
Published Apr 2026 · cc-by-nc-4.0 · hosam12kalad
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…
Published Mar 2026 · cc-by-4.0 · nvidia
PhysicalAI-Robotics-Manipulation-Kitchen-Demos We provide a 600 hours of human-teleoperated demonstrations across 316 different tasks, totalling 55k trajectories.
Published Mar 2026 · apache-2.0 · ManipArena
ManipArena Dataset Training dataset for ManipArena, a real-robot benchmark and competition for bimanual manipulation at the CVPR 2026 Embodied AI Workshop.
Published Jun 2025 · cc-by-4.0 · nvidia
PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a dataset from huggingface.
Published Mar 2026 · cc-by-4.0 · markov-ai
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Feb 2026 · apache-2.0 · BAAI-Humanoid
DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter DECO-50 is a bimanual dexterous manipulation dataset with tactile sensing, comprising 50 hours of teleoperated data across 4…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 50, "total_chunks": 1,…
Published Apr 2026 · cc-by-nc-sa-4.0 · yixuan-tan
InternData-A1 dataset taken from InternRobotics/InternData-A1, with the tarballs extracted and directory structure "transposed" so that the top-level subdirectories are the four embodiments.
Published Apr 2026 · mit · sii-rhos-ai
ViFailback Dataset: Real-World Robotic Manipulation Failure Dataset with Visual Symbol Guidance A real-world dataset for diagnosing, correcting, and learning from robotic manipulation failures via visual symbols.
Published Oct 2025 · apache-2.0 · USC-PSI-Lab
Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world robotic learning and embodied intelligence.
Published Sep 2025 · apache-2.0 · HuggingFaceVLA
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "fps": 10.0, "splits": {…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks":1, "total_videos": 206, "total_chunks": 1,…
Published Feb 2026 · cc-by-4.0 · kze193825
PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a dataset from huggingface.
Published May 2025 · apache-2.0 · Qu3tzal
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "WidowX", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…
Published Apr 2026 · cc-by-4.0 · fuzirui
PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a dataset from huggingface.
Published Feb 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks": 1, "chunks_size": 1000,…
Published Oct 2024 · cc-by-4.0 · jxu124
Open X-Embodiment Dataset (unofficial) This is an unofficial Dataset Repo. This Repo is set up to make Open X-Embodiment Dataset (55 in 1) more accessible for people who love huggingface🤗.
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 10770, "total_frames": 26748966, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · x-humanoid-robomind
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation Accepted by Robotics: Science and Systems (RSS) 2025.
Published Feb 2026 · cc-by-4.0 · jnogga
DROID Success Episodes Successful episodes in DROID-COMMUNITY. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~89% of all successful episodes.
Published Feb 2026 · not specified · InternRobotics
InternScenes InternScenes is a large-scale interactive indoor scene dataset with realistic layouts. This dataset comprises approximately 40,000 diverse scenes and 1.96M 3D objects that cover 15 common scene types and 288 object classes,…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_videos": 50, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · X-Humanoid
ArtVIP dataset card 🎉🎉🎉 ArtVIP is accepted by ICLR 2026 Key Features ✅ 206 high-quality digital-twin articulated objects.
Published Feb 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5688, "total_frames": 14129038, "total_tasks": 1, "chunks_size": 1000,…
Published Jun 2025 · apache-2.0 · jesbu1
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…
Published Feb 2026 · mit · Salesforce
3D Optical Flow DROID Dataset Processed DROID robotics dataset with optical flow and scene flow annotations. Dataset Structure Organized by lab, each trajectory in separate tar.gz archive:…
Published Apr 2026 · cc-by-4.0 · helpechtox98988
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 50, "total_chunks": 1,…
Published Feb 2026 · mit · OpenDriveLab
📦 FreeTacman Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation [ICRA 2026] 🎯 Overview This dataset supports the paper FreeTacman: Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation.
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5551, "total_frames": 13805156, "total_tasks": 1, "chunks_size": 1000,…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 206, "total_frames": 25650, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · Dhanush944
ManiTwin-100K: Manipulation-Ready Digital Object Twins Project Page | Paper ManiTwin-100K is a large-scale dataset of manipulation-ready digital object twins designed for robotic manipulation research.
Published Feb 2026 · apache-2.0 · Traly
RoboChallenge-lerobot-merged Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset (merged version) in LeRobot v3.0 format.
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Mar 2026 · cc-by-nc-sa-4.0 · OpenDriveLab-org
KAI0. Contents About the Dataset Load the Dataset Download the Dataset Dataset Structure Folder hierarchy Details License and Citation About the Dataset ~134 hours real world scenarios Main Tasks Task_A Single task Initial state: T-shirts…
Published Sep 2025 · not specified · agibot-world
⚠️Important Notice !!! Dear Users, The Alpha Dataset has been updated as follows: Frame Loss Data Removal: Several episodes with frame loss issues have been removed.
Published Mar 2026 · apache-2.0 · FedorX8
Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world robotic learning and embodied intelligence.
Published Dec 2025 · apache-2.0 · RoboCOIN
AgiBot-g1_picks_up_parts_b 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.
Published Apr 2026 · cc-by-4.0 · ChangChrisLiu
GNN Constraint-Aware World Model Dataset (v3) Real robot episodes with per-frame constraint graphs, SAM2 segmentation masks + 256-D feature embeddings, full 3D depth bundles, and synchronized robot states across two manipulation domains.
Published Feb 2026 · cc-by-nc-sa-4.0 · RealSourceData
RealSource World RealSource World is a large-scale real-world robotics manipulation dataset collected using RS-02 dual-arm humanoid robot.
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5413, "total_frames": 13497374, "total_tasks": 1, "chunks_size": 1000,…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Apr 2026 · bsd-3-clause · haichaozhang
ThinkJEPA Proprocessed Data Cache Dataset Description This repository contains the released preprocessed cache used by ThinkJEPA: ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model.
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 5357, "total_frames": 13251592, "total_tasks": 1, "chunks_size": 1000,…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Jan 2026 · mit · leggedrobotics
The GrandTour Dataset A project brought to you by RSL - ETH Zurich. References • Contributing • Citation References Official dataset webpage: grand-tour.leggedrobotics.com Getting started & examples:…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · CollisionCode
RoboCerebra (LeRobot v2.1 Format) Dataset Description This dataset is a converted version of RoboCerebra, now formatted for compatibility with the LeRobot ecosystem (v2.1).
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…
Published Sep 2025 · apache-2.0 · yaak-ai
TL;DR of L2D, the world's largest self-driving dataset! Read more about L2D on the official Huggingface blog: LeRobot goes to driving school 90+ TeraBytes of multimodal data (5000+ hours of driving) from 30 cities in Germany 6x surrounding…
Published Apr 2026 · apache-2.0 · InternRobotics
Sim1_Dataset LeRobot-format manipulation dataset for cloth folding / deformable object tasks. Format This repository follows a LeRobot-style layout per subset: data/chunk-xxx/episode_XXXXXX.parquet: frame-level trajectories…
Published Apr 2026 · cc-by-nc-sa-4.0 · meituan-longcat
LARY — A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment LARY is a unified evaluation framework for latent action representations.
Published Apr 2026 · not specified · yuanyyaa
AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Meade, Arkil Patel, Dongchan Shin,…
Published Apr 2026 · cc-by-4.0 · CliffKai
Bridge-CoT Robot manipulation dataset with chain-of-thought annotations, derived from BridgeDataV2. Each sample pairs a scene image with a task description and structured VLM-generated annotations including object detection, spatial…
Published Apr 2026 · apache-2.0 · RoboCOIN
Agilex_Cobot_Magic_place_towel_flat Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.
Published Apr 2026 · other · hzxie
Dynamic Object Manipulation (DOM) Project Page | Paper | Code TL;DR: DOM is a large-scale dynamic manipulation dataset with 200K episodes, 2,800+ scenes, and 206 objects for training and evaluating VLA models.
Published Feb 2026 · cc-by-4.0 · jnogga
DROID Success (High-Quality Extrinsics) Subset of ~17k successful episodes in DROID-COMMUNITY filtered for high-quality camera extrinsics.
Published Apr 2026 · mit · Sswa12
Reconstructed GNM Trajectories GNM-format trajectories reconstructed from RECON, SCAND, GoStanford2, and other navigation datasets. Used as training data for the CAST safety pipeline and Neural CBF.
Published Sep 2025 · cc-by-4.0 · allenai
This dataset was created using LeRobot. Dataset Description This dataset contains MolmoAct Dataset in lerobot format. All contents in this dataset were collected in-house by Ai2.
Published Mar 2026 · mit · XDUImageLab
SandThink Dataset (v1.0) SandThink 是一个专为具身智能 (Embodied AI) 任务设计的大规模指令微调与偏好对齐数据集。该数据集通过结构化的 Chain-of-Thought (CoT) 推理过程,显著提升了 Vision-Language-Action (VLA) 模型在复杂环境下的任务拆解、路径规划和动作执行能力。 📊 数据集概览 (Dataset Summary) 本数据集包含三个核心组件,总计约 37,000 条高质量数据:…
Published Feb 2026 · apache-2.0 · DAVIAN-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630, "chunks_size": 1000,…
Published Apr 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 8612, "total_frames": 1137459, "total_tasks": 24, "total_videos": 34448, "total_chunks": 9,…
Published Jan 2026 · mit · delinqu
Comet-1.5k — RFT Dataset (Openpi-Comet) This repository releases Comet-1.5k, an RFT (Rejection Sampling Fine-Tuning) dataset generated by Team Comet’s post-training pipeline on the BEHAVIOR-1K simulator (OmniGibson).It is intended to…
Published Feb 2026 · apache-2.0 · Traly
RoboChallenge-lerobot Note: The original dataset comes from RoboChallenge/Table30. This is an unofficial conversion to LeRobot v3.0 format. Dataset Description RoboChallenge benchmark dataset in LeRobot v3.0 format.
Published Apr 2026 · apache-2.0 · RoboCOIN
Agilex_Cobot_Magic_heat_burger Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.
Published Dec 2025 · apache-2.0 · RoboCOIN
Cobot_Magic_make_hamburger 📋 Overview This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.
Published Mar 2026 · mit · yatin-superintelligence
Edge Agent Reasoning WebSearch 260K Abstract The Edge-Agent-Reasoning-WebSearch-260K dataset is a massive, synthetically expert-engineered corpus of over 700 Million tokens, designed to train small, local models (SLMs) and edge-deployed…
Published Apr 2026 · apache-2.0 · Joocjun
GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobot v2.0 format.
Published Sep 2025 · cc-by-4.0 · allenai
MolmoAct - Pretraining Mixture Data Mixture used for MolmoAct Pretraining. Contains a subset of OXE formulated as Action Reasoning Data along with auxiliary robot data and link to Multimodal Web data.
Published Apr 2026 · apache-2.0 · RoboCOIN
Agilex_Cobot_Magic_storage_towel Dataset Description This dataset uses an extended format based on LeRobot and is fully compatible with LeRobot.
Published Jan 2026 · apache-2.0 · ai4ce
Wanderland Dataset Dataset Description Wanderland is a large-scale urban dataset designed for geometrically grounded simulation and open-world embodied AI research.
Published Apr 2026 · cc-by-nc-4.0 · tasl-lab
PDD: Personalized Driving Dataset Dataset Description PDD (Personalized Driving Dataset) is a multi-driver, multi-scenario driving dataset collected in CARLA 0.9.15.
Published May 2025 · cc-by-4.0 · nvidia
Dataset Description: PhysicalAI-Robotics-Manipulation-SingeArm is a collection of datasets of automatic generated motions of a Franka Panda robot performing operations such as block stacking, opening cabinets and drawers.
Published Feb 2026 · not specified · InternRobotics
RoboInter-Data: Intermediate Representation Annotations for Robot Manipulation Rich, dense, per-frame intermediate representation annotations for robot manipulation, built on top of DROID and RH20T.
Published Apr 2025 · mit · RogerQi
Dataset Description This dataset contains egocentric human-humanoid data that can be used to co-train manipulation policy for humanoid robot.
Published Apr 2026 · not specified · eddie-cui
R3D: Revisiting 3D Policy Learning Project Page | Paper | GitHub This repository contains the pre-processed data for R3D, a framework for robust and scalable 3D imitation learning.
Published Dec 2025 · cc-by-nc-4.0 · spatialverse
SAGE-3D InteriorGS USDZ: USDZ-Format 3D Gaussian Scenes for Isaac Sim Paper | Project Page | Code InteriorGS dataset converted to USDZ format for seamless integration with NVIDIA Omniverse and Isaac Sim platforms.
Published Apr 2026 · other · LumosRobotics-FastUMIPro
FastUMI Pro – Multimodal Sample Dataset Small-Scale Demonstration Data from the FastUMI Pro Multimodal Sensing System (Only Hundreds of Trajectories — Full Dataset Available Upon Request) Project Homepage 📦 FastUMI Data Market 🔥FastUMI…
Published Apr 2026 · cc-by-nc-sa-4.0 · Physis-AI
WM-Eval Samples: EWMBench Generated Videos Pre-generated video samples for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework.
Published Apr 2026 · apache-2.0 · maum-ai
CostNav Teleop Dataset Dataset Summary The CostNav Teleop Dataset is a large-scale collection of human teleoperation recordings for robot navigation in an urban sidewalk simulation environment.
Published Apr 2026 · mit · ln2697
LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving Project Page | Paper | Code Official CARLA dataset accompanies our paper LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving.
Published Apr 2026 · apache-2.0 · CollisionCode
📦 RoboCerebra HDF5 Dataset This repository contains the structured HDF5 dataset for the RoboCerebra project, optimized for high-performance robot learning training. ✅ Status: Data Upload Complete.
Published Dec 2025 · cc-by-4.0 · nvidia
NVIDIA Physical AI SimReady Warehouse OpenUSD Dataset Dataset Version: 1.1.0 Date: May 18, 2025 Author: NVIDIA, Corporation License: CC-BY-4.0 (Creative Commons Attribution 4.0 International) Contents This dataset includes the following:…
Published Oct 2025 · cc-by-4.0 · LeoFan01
RoboBench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain 📋 Overview RoboBench is a comprehensive evaluation benchmark designed to assess the capabilities of Multimodal Large Language Models…
Published Mar 2026 · apache-2.0 · nepfaff
SceneSmith Example Scenes Project Page | Paper | Code Example scenes generated by SceneSmith, a hierarchical agentic framework for constructing simulation-ready indoor environments from natural language prompts.
Published Apr 2026 · cc-by-4.0 · Sohai51515151
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Dec 2025 · apache-2.0 · HuggingFaceVLA
Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets across 46 robot types, enabling…
Published Nov 2025 · mit · shihao1895
Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Bridge dataset.
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks":40, "chunks_size": 1000, "data_files_size_in_mb":…
Published Apr 2025 · apache-2.0 · cadene
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596, "chunks_size": 1000,…
Published Dec 2025 · apache-2.0 · ryanqian1994
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1999410, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…
Published Nov 2025 · apache-2.0 · HuggingFaceVLA
Community Dataset v1 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 128 datasets from 55 contributors worldwide. We used this dataset to pretrain SmolVLA.
Published Dec 2025 · apache-2.0 · IPEC-COMMUNITY
🤖 EO-Data-1.5M A Large-Scale Interleaved Vision-Text-Action Dataset for Embodied AI The first large-scale interleaved embodied dataset emphasizing temporal dynamics and causal dependencies among vision, language, and action modalities.
Published Feb 2026 · cc-by-4.0 · mkxdxd
CARLA Stage 2 Pedestrian Dataset A large-scale driving dataset (Stage 2) focused on pedestrians, captured from CARLA simulator.
Published Apr 2026 · mit · QianGroup
Pick object and place in box Project KIWI | Qian Group HRI Lab | University of Houston Detail Value Task Pick object and place in box Episodes 50 FPS 30 Cameras Gripper (Arducam) + Overhead (Intel RealSense) Robot SO-101 (Feetech STS3215)…
Published Apr 2025 · cc-by-4.0 · fleaven
Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and applications in robotics and human-robot…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…
Published Sep 2025 · apache-2.0 · jesbu1
PEEK VLM-Labeled BRIDGE_v2 dataset This dataset contains the LeRobot-format BRIDGE-v2 dataset with paths and masks from the PEEK VLM drawn onto the image: PEEK: Guiding and Minimal Image Representations for Zero-Shot Generalization of…
Published Apr 2025 · cc-by-4.0 · fleaven
Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and applications in robotics and human-robot…
Published Feb 2026 · cc-by-4.0 · varunburde
Object Pose Estimation Using Implicit Representation for Transparent Objects This dataset aggregates high quality 3D mesh assets and rendered data for training and fine-tuning pose estimation models.
Published Apr 2026 · mit · leesangoh
PhysProbe Dynamics Probing Dataset Manipulation episodes from Isaac Lab collected for probing physics understanding in video world models (V-JEPA 2, VideoMAE, DINOv2).
Published Feb 2026 · cc-by-4.0 · jnogga
DROID Failure Episodes Failure episodes in DROID-COMMUNITY. Episodes with inconsistent data were discarded, leaving ~91% of all failure episodes.
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…
Published Jul 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 3921, "total_frames": 569249, "total_tasks": 73, "total_videos": 7842, "total_chunks": 3,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 800, "total_chunks": 1,…
Published Jan 2026 · apache-2.0 · Traly
Note This is an unofficial implementation. This dataset was converted from AgiBotWorld-Alpha into the LeRobot format using https://github.com/Tavish9/any4lerobot/tree/main/agibot2lerobot Configurations Each sub-task is available as a…
Published Mar 2026 · cc-by-nc-4.0 · OpenMOSS-Team
RoboOmni: Proactive Robot Manipulation in Omni-modal Context 📖 arXiv Paper (Accepted to ICLR 2026 🎉) | 🌐 Website | 🤗 Model | 🤗 Dataset | 🛠️ Github | Recent advances in Multimodal Large Language Models (MLLMs) have driven rapid…
Published Mar 2026 · cc-by-4.0 · Whiteglove44
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 104, "total_videos": 39350,…
Published Mar 2026 · odc-by · Oatmealliu
UrbanVerse-100K Dataset [!NOTE] UrbanVerse-100K is a large-scale, physics-aware 3D asset and material database curated for urban simulation, physical and embodied AI research.
Published Mar 2026 · cc-by-4.0 · bitsydarel
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Mar 2026 · cc-by-4.0 · nawed
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Mar 2026 · cc-by-nc-nd-4.0 · rethinklab
Bench2Drive-Speed Project Page | Paper | GitHub Bench2Drive-Speed is a closed-loop benchmark for desired-speed conditioned autonomous driving, enabling explicit control over vehicle behavior through target speed and overtake/follow…
Published Apr 2026 · other · bones-studio
BONES-SEED: Skeletal Everyday Embodiment Dataset BONES-SEED (Skeletal Everyday Embodiment Dataset) is an open dataset of 142,220 annotated human motion animations for humanoid robotics.
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Mar 2026 · cc-by-4.0 · AGuyWithAnAI
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Mar 2026 · cc-by-4.0 · NeonoV1
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Mar 2026 · cc-by-4.0 · AGuyWithAnAI
Computer Use Large A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet.
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 2366, "total_frames": 6205242, "total_tasks": 1, "chunks_size": 1000,…
Published Mar 2025 · cc-by-4.0 · ember-lab-berkeley
Retargeted AMASS Dataset for G1 This repository contains 100% of the AMASS dataset retargeted to the G1 humanoid and formatted for use with IsaacLab's AMP motion loader. It should also be compatible with ProtoMotions.
Published Jan 2026 · apache-2.0 · ygtxr1997
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 599, "total_videos": 87212,…
Published Mar 2026 · mit · riverfog7
The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments, annotated with natural language.
Published Apr 2026 · cc-by-nc-sa-4.0 · Physis-AI
WM-Eval GT: EWMBench Ground Truth Ground truth data for EWMBench evaluation, extracted and restructured for direct use by the wm-evaluation-harness framework. Source Extracted from agibot-world/EWMBench (gt_dataset.tar).
Published Oct 2025 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 85, "total_frames": 127500, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Nov 2025 · apache-2.0 · HuggingFaceVLA
Community Dataset v2 A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 340 datasets from 117 contributors worldwide.
Published Jan 2026 · mit · ganatrask
DR_dataset Simulated robot manipulation dataset for imitation learning. Dataset Description This dataset contains 200 episodes of a Trossen WXAI robotic arm performing food transfer tasks in MuJoCo simulation.
Published Mar 2026 · cc-by-nc-sa-4.0 · ori-drs
We present the Oxford Spires Dataset, captured in and around well-known landmarks in Oxford using a custom-built multi-sensor perception unit as well as a millimetre-accurate map from a terrestrial LiDAR scanner (TLS).
Published Apr 2026 · apache-2.0 · DerekLX
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "chunks_size": 1000, "data_files_size_in_mb":…
Published Jan 2026 · apache-2.0 · OpenDriveLab
Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Tian ([email protected]) 📜…
Published Mar 2026 · apache-2.0 · ehalicki
LeWAM Community Dataset A large-scale community-contributed robotics dataset for robot learning research featuring 321 datasets from the LeRobot community. Derived from Community Dataset v2, and converted to LeRobotDataset v3.0.
Published May 2025 · apache-2.0 · cadene
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AgiBot_A2D", "total_episodes": 28122, "total_frames": 47613574, "total_tasks": 30, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · mkkimuser
This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 82092, "total_tasks": 1, "total_videos": 1500,…
Published Aug 2025 · apache-2.0 · Voxel51
Dataset Card for aloha_pen_uncap This dataset is a FiftyOne conversion in LeRobot format of the aloha_pen_uncap_diverse subset of BiPlay.
Published Nov 2025 · apache-2.0 · frodobots
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "frodobot", "total_episodes": 185761, "total_frames": 163360777, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1600, "total_frames": 768897, "total_tasks": 116, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · H-EmbodVis
Towards Generalizable Robotic Manipulation in Dynamic Environments Heng Fang1, Shangru Li1, Shuhan Wang1, Xuanyang Xi2, Dingkang Liang1, Xiang Bai1 1 Huazhong University of Science and Technology, 2 Huawei Technologies Co.
Published Mar 2026 · cc-by-nc-sa-4.0 · HXX
MarketGen : A Scalable Simulation Platform with Auto-Generated Embodied Supermarket Environments 📑 arXiv | 🌐 Project | 🤗 Hugging Face MarketGen is a scalable simulation platform with automatic scene generation for embodied supermarket…
Published Feb 2026 · apache-2.0 · peak888
This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.
Published Apr 2025 · not specified · McGill-NLP
AgentRewardBench 💾Code 📄Paper 🌐Website 🤗Dataset 💻Demo 🏆Leaderboard AgentRewardBench: Evaluating Automatic Evaluations of Web Agent TrajectoriesXing Han Lù, Amirhossein Kazemnejad*, Nicholas Meade, Arkil Patel, Dongchan Shin,…
Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "total_videos": 10200, "total_chunks": 6,…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Apr 2026 · mit · VLABench
VLABench Primitive Tasks Dataset - LeRobot v3.0 Dataset Description This dataset is organized in the LeRobot v3.0 format and is used for integrating VLABench into the LeRobot framework officially.
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Jul 2025 · apache-2.0 · haosulab
ManiSkill Demonstrations This dataset repo contains all of the latest ManiSkill demonstration datasets as well as some pretained model weights used to generate some demonstrations.
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Mar 2026 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 800, "total_frames": 20000, "total_tasks": 1, "total_videos": 0, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 32212, "total_tasks": 47, "total_videos": 300, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · ROBOTIS
This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "total_episodes": 857, "total_frames": 85474, "total_videos": 2828, "codebase_version": "v2.1", "robot_type": "ffw_bg2_rev4", "total_tasks":…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19229 (MVP) Competition task: 抓水果 (Fruit Grasping) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Dec 2025 · cc-by-nc-4.0 · spatialverse
SAGE-3D Collision Mesh: Physics-Enabled Collision Bodies for 3D Gaussian Scenes Paper | Project Page | Code High-precision collision geometry dataset extracted from 1,000 indoor Mesh scenes, enabling physically accurate navigation and…
Published Apr 2026 · apache-2.0 · williamdgomez
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 91, "total_frames": 73578, "total_tasks": 1, "chunks_size": 1000,…
Published Nov 2025 · mit · shihao1895
Dataset Structure These datasets are used for MemoryVLA training. This is the standard LIBERO setting and can be directly used for other models as well.All data follow the RLDS format from the LIBERO benchmark, where each task initially…
Published Apr 2026 · apache-2.0 · TrossenRoboticsCommunity
This dataset was created using LeRobot. Dataset Description Converted from TrossenMCAP format using the Trossen SDK trossen_mcap_to_lerobot_v2 tool.
Published Apr 2026 · cc-by-sa-4.0 · yxgao
from datasets import Features, Value, Video, load_dataset features = Features({ "head_color.mp4": Video(), "proprio_stats.hdf5": Value("binary"), "task.json": { "episode_id": Value("int64"), "init_scene_text": Value("string"),…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "total_videos": 6484, "total_chunks": 4,…
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 5208, "total_frames": 1139911, "total_tasks": 7, "total_videos": 5208, "total_chunks":…
Published Apr 2026 · cc-by-nc-4.0 · open-world-agents
D2E-480p Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI.
Published Nov 2025 · mit · shihao1895
Dataset Structure These datasets are used for MemoryVLA training. This is the standard setting and can be directly used for other models as well.All data follow the RLDS format from the Fractal dataset.
Published Apr 2026 · cc-by-nc-4.0 · open-world-agents
D2E-Original Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI.
Published Sep 2025 · mit · michaelmunje
SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation This is the accompying dataset for the Social Navigation Scene Understanding Benchmark (SocialNav-SUB) which is a Visual Question Answering (VQA) dataset…
Published Apr 2025 · cc-by-4.0 · fleaven
Retargeted AMASS for Robotics Project Overview This project aims to retarget motion data from the AMASS dataset to various robot models and open-source the retargeted data to facilitate research and applications in robotics and human-robot…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1647, "total_frames": 42328, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5", "total_episodes": 896, "total_frames": 87783, "total_tasks": 5, "total_videos": 1792, "total_chunks": 1,…
Published Oct 2025 · apache-2.0 · zahidpichen
This is ScanNet dataset containing indoor scenes which is used for 3d object detection, 3d segmentation, etc. Acknowledgement @inproceedings{dai2017scannet, title={ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes},…
Published Jul 2025 · cc-by-sa-4.0 · suhaisheng0527
Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments Description RoboSense is a large-scale multimodal dataset constructed to facilitate egocentric robot perception…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Apr 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 638, "total_frames": 421684, "total_tasks": 1, "chunks_size": 1000,…
Published Aug 2025 · bsd-2-clause · AmeliaCMU
Dataset Overview The Amelia42-Mini dataset provides air traffic position reports for 42 major U.S. airports, including the following airports: KATL (Hartsfield-Jackson Atlanta International Airport) KBDL (Bradley International Airport)…
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Apr 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 703, "total_frames": 456261, "total_tasks": 1, "chunks_size": 1000,…
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1, "total_frames": 841, "total_tasks": 1, "chunks_size": 1000,…
Published Nov 2025 · apache-2.0 · dunnolab
This dataset was created using LeRobot. Dataset Description The English version of this dataset integrates 598 open-source community datasets into a single unified corpus, comprising 22,709 episodes and approximately 9.4 million frames…
Published Mar 2026 · mit · failo0711
ATG-MoE Pressure-Reducing Valve Assembly Training Set This repository provides the training set used in our paper: "ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning".
Published Dec 2025 · cc-by-nc-4.0 · maum-ai
COMMAND Dataset 😃 🤗 Dataset Availability The evaluation data from this dataset is now available on Hugging Face: Repository: maum-ai/COMMAND Content: 60 evaluation scenarios (240 files, ~38GB) File Types: ROS bagfiles (.bag), metadata…
Published Apr 2026 · mit · HAI-Lab
LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models LIBERO-Para is a controlled benchmark for evaluating the paraphrase robustness of Vision-Language-Action (VLA) models.
Published Aug 2025 · cc-by-4.0 · allenai
MolmoAct - Midtraining Mixture Data Mixture used for MolmoAct Midtraining. Contains MolmoAct Dataset formulated as Action Reasoning Data.
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Apr 2026 · apache-2.0 · rainbowrobotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_size": 1000, "data_files_size_in_mb":…
Published Sep 2025 · not specified · InternRobotics
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning 🔑Key Features MesaTask-10K, a large-scale dataset for task-oriented tabletop scene generation, comprises approximately 10,700 synthetic tabletop scenes…
Published Mar 2026 · apache-2.0 · XDUImageLab
SandGO Dataset (v1.0) SandGO 是一个专为具身智能 (Embodied AI) 和机器人学习设计的 long-horizon 多模态数据集。该数据集由原始 MarsMind_data 经过精炼、去冗余和结构化处理后得到,旨在支持 长序列决策、指令跟随 和 多模态模仿学习 等任务。数据集包含完整的状态‑动作序列、多视角 RGB 图像、LiDAR 点云、深度点云 以及对应的 自然语言指令,为训练 Vision-Language-Action (VLA)…
Published Apr 2026 · cc-by-4.0 · Chuhaojin
SuSuInterActs Dataset A Large-Scale Multimodal Dialogue Corpus with Synchronized Speech, Full-Body Motion, and Facial Expressions From the paper: SentiAvatar: Towards Expressive and Interactive Digital Humans Overview SuSuInterActs is a…
Published May 2026 · apache-2.0 · RukawaY
A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting Ziyuan Xia • Jingyi Xu • Chong Cui • Yuanhong Yu• Jiazhao Zhang • Qingsong Yan • Tao Ni Junbo Chen • Xiaowei Zhou • Hujun Bao • Ruizhen Hu • Sida Peng 🤗 About This…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19382 (别碰我爪子) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Oct 2025 · cc-by-nc-nd-4.0 · chenhn02
Dataset Card for Dataset Name MetaFold Dataset is a point-cloud trajectory dataset designed for multi-category garment folding tasks in robotic manipulation.
Published Sep 2025 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100_follower", "total_episodes": 50, "total_frames": 11939, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1183, "total_frames": 3102621, "total_tasks": 1, "chunks_size": 1000,…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "sawyer", "total_episodes": 1796, "total_frames": 168423, "total_tasks": 3, "total_videos": 1796, "total_chunks": 2,…
Published Oct 2025 · apache-2.0 · yaak-ai
This dataset was created using LeRobot. This is a release R2 (10K episodses) of yaak-ai/L2D in LeRobot Dataset V3 format. R3 release of 100K episode is now available here in LeRobotDataset V3 format.
Published Mar 2026 · apache-2.0 · Gongsta
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 141, "total_frames": 306425, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · ehalicki
LeWAM Community Dataset (Preprocessed) A large-scale community-contributed robotics dataset for robot learning research featuring 318 datasets from the LeRobot community.
Published Apr 2026 · apache-2.0 · kimz1121
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 2, "total_frames": 309, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Mar 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19380 (MAEN) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Feb 2026 · apache-2.0 · VLA-Arena
VLA-Arena Dataset (L0 - Large Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models.
Published Sep 2025 · cc-by-4.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 97939, "total_tasks": 5, "total_videos": 3000, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19361 (DynamicX) Competition task: 环套在柱子上 (Put Ring onto Rod) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Jan 2025 · mit · oier-mees
BiPlay contains 9.7 hours of bimanual data collected with an aloha robot at the RAIL lab @ UC Berkeley, USA. It contains 7023 clips, 2000 language annotations and 326 unique scenes.
Published Sep 2025 · cc-by-4.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1085, "total_frames": 77965, "total_tasks": 89, "total_videos": 2170, "total_chunks": 2,…
Published Apr 2025 · apache-2.0 · Felix-Zhenghao
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101499, "total_tasks": 10, "total_videos": 0, "total_chunks": 1,…
Published Jan 2026 · apache-2.0 · OpenDriveLab-org
Haochen Tian, Tianyu Li, Haochen Liu, Jiazhi Yang, Yihang Qiu, Guang Li, Junli Wang, Yinfeng Gao, Zhang Zhang, Liang Wang, Hangjun Ye, Tieniu Tan, Long Chen, Hongyang Li 📧 Primary Contact: Haochen Tian ([email protected]) 📜…
Published Mar 2026 · cc-by-4.0 · Richard-Nai
Raw Data for HuMI [Project Page] | [Paper] Dataset Summary This dataset contains raw data collected using the HuMI data collection pipeline. It includes MP4 videos and tracker trajectories.
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "total_videos": 3000, "total_chunks": 2,…
Published Mar 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1319, "total_frames": 3414338, "total_tasks": 1, "chunks_size": 1000,…
Published May 2025 · cc-by-4.0 · nvidia
PhysicalAI Robotics Manipulation in the Kitchen Dataset Description: PhysicalAI-Robotics-Manipulation-Kitchen is a dataset of automatic generated motions of robots performing operations such as opening and closing cabinets, drawers,…
Published Jan 2026 · apache-2.0 · ygtxr1997
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974, "total_videos": 212768, "total_chunks":…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1000, "total_frames": 100000, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…
Published Oct 2025 · apache-2.0 · daixianjie
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "panda", "total_episodes": 3958, "total_frames": 574553, "total_tasks": 89, "total_videos": 0, "total_chunks": 4,…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19234 (Apex) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…
Published Apr 2026 · apache-2.0 · mkkimuser
This dataset was created using Physical AI Tools and LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "FFW_SG2", "total_episodes": 500, "total_frames": 79017, "total_tasks":1, "total_videos": 1500,…
Published Apr 2026 · cc-by-4.0 · sunli1201
This dataset was created using LeRobot. Dataset Description This dataset combines four individual Libero datasets: Libero-Spatial, Libero-Object, Libero-Goal and Libero-10.
Published Mar 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1197, "total_frames": 3146294, "total_tasks": 1, "chunks_size": 1000,…
Published Feb 2026 · not specified · InternRobotics
RobotInter-VQA: Intermediate Representation Understanding & Generation VQA Dataset for Manipulation English | 简体中文 A Visual Question Answering dataset for robotic manipulation, developed as part of the RoboInter, covering generation,…
Published Oct 2025 · mit · Sylvest
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website | 🤗 Assets | 🤗 Model | 📁 Training Dataset 🔥 Overview This repository contains the official implementation and benchmark for our…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total_videos": 908, "total_chunks": 1,…
Published Sep 2025 · cc-by-4.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3603, "total_frames": 237798, "total_tasks": 406, "total_videos": 7206, "total_chunks": 4,…
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Nov 2025 · apache-2.0 · davidlinjiahao
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 5, "total_frames": 1153, "total_tasks": 1, "total_videos": 5, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · Joocjun
GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobot v2.0 format.
Published Mar 2026 · apache-2.0 · Gongsta
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 132, "total_frames": 273043, "total_tasks": 1, "chunks_size": 1000,…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "jaco_2", "total_episodes": 976, "total_frames": 70127, "total_tasks": 88, "total_videos": 1952, "total_chunks": 1,…
Published Dec 2025 · apache-2.0 · ducido
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 5124, "total_frames": 308918, "total_tasks": 389, "total_videos": 0, "total_chunks": 6,…
Published Sep 2025 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 50, "total_frames": 19631, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total_videos": 1118, "total_chunks": 1,…
Published Dec 2025 · apache-2.0 · izuluaga
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 80, "total_frames": 70277, "total_tasks": 2, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…
Published Jan 2026 · mit · Everloom
Active Perception AAWR Dataset in GRASP Lab Mock Kitchen Paper | Project page | Code Here is our full dataset (13.5GB) in DROID format: https://huggingface.co/datasets/Everloom/AAWR-DROID/ It includes 4 scene as below, each are cleaned…
Published Dec 2025 · cc-by-4.0 · spatialverse
SAGE-3D VLN Data: Vision-Language Navigation Dataset with Hierarchical Instructions Paper | Project Page | Code A comprehensive VLN dataset featuring 2 million trajectory-instruction pairs across 1,000 indoor scenes, with hierarchical…
Published Apr 2026 · apache-2.0 · Purple69
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 1345076, "total_tasks": 72, "chunks_size": 1000,…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Apr 2026 · cc-by-sa-4.0 · Stardust-hyx
BAPData (EFM-10 Benchmark) Project Website | Paper BAPData is a dataset introduced in the paper "Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark and Strategy".
Published Jan 2025 · mit · oier-mees
The FuSe dataset contains 26,866 trajectories collected on a WidowX robot at the RAIL lab @ UC Berkeley, USA. It contains visual, tactile, sound and action data collected across several environments, annotated with natural language.
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 902, "total_frames": 294139, "total_tasks": 1, "total_videos": 902, "total_chunks": 1,…
Published Jan 2025 · apache-2.0 · danaaubakirova
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "koch", "total_episodes": 51, "total_frames": 16602, "total_tasks": 1, "total_videos": 102, "total_chunks": 1,…
Published Feb 2026 · apache-2.0 · FedorX8
This dataset was created using LeRobot. Dataset Description Converted BridgeV2 dataset to lerobot format. Downloaded from [https://rail.eecs.berkeley.edu/datasets/bridge_release/data/] (only demos_8_17.zip, not scripted_6_18.zip) Dataset…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 454, "total_frames": 66984, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…
Published Nov 2025 · apache-2.0 · dunnolab
Датасет создан при помощи библиотеки LeRobot. Описание датасета Русскоязычная версия данного датасета объединяет 598 открытых датасетов сообщества в единый унифицированный корпус, включающий 22 709 эпизодов и примерно 9,4 миллиона кадров…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 3000, "total_frames": 149985, "total_tasks": 1, "total_videos": 3000, "total_chunks": 3,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 18, "total_frames": 67500, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Apr 2026 · apache-2.0 · kaiseong
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "rby1", "total_episodes": 1105, "total_frames": 235211, "total_tasks": 3, "chunks_size": 1000, "data_files_size_in_mb":…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 631, "total_frames": 146241, "total_tasks": 7, "total_videos": 1262, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · brandonyang
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 4444, "total_frames": 930500, "total_tasks": 1424, "total_videos": 0, "total_chunks": 5,…
Published Apr 2026 · apache-2.0 · alesspalma
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1706, "total_frames": 71623, "total_tasks": 201, "total_videos": 0, "total_chunks": 2,…
Published Apr 2026 · other · amathislab
MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoFullBody musculoskeletal model, generated using General Motion Retargeting (GMR).
Published Jan 2026 · apache-2.0 · JingkunAn
TraceSpatial-Bench: An Object-Centric 3D Trajectory Planning Benchmark Welcome to TraceSpatial-Bench, an object-centric 3D spatial trace planning benchmark provided by RoboTracer.TraceSpatial-Bench is the first benchmark that evaluates…
Published Feb 2026 · mit · ThiesOelerich
SafeFlowMPC Finetuning Dataset This repository contains the finetuning dataset used in the paper SafeFlowMPC: Predictive and Safe Trajectory Planning for Robot Manipulators with Learning-based Policies.
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Apr 2026 · cc-by-nc-sa-4.0 · OpenDriveLab-org
TAMEn: Tactile-Aware Manipulation Engine for Closed-Loop Data Collection in Contact-Rich Tasks 🎯 Overview TAMEn builds upon the UMI paradigm with key enhancements in multimodality, precision-portability synergy, replayability, and data…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 2460, "total_frames": 112980, "total_tasks": 9, "total_videos": 9840, "total_chunks": 3,…
Published Apr 2026 · apache-2.0 · Joocjun
GR1 Tabletop Merged LeRobot Datasets Merged and subsampled versions of the GR1 tabletop manipulation datasets from the NVIDIA PhysicalAI-Robotics-GR00T-X-Embodiment-Sim collection, formatted in LeRobot v2.0 format.
Published Feb 2026 · mit · microsoft
VITRA Teleoperation Dataset Dataset Summary This dataset contains real-world robot teleoperation demonstrations collected using a 7-DoF robotic arm equipped with a dexterous hand and a head-mounted RGB camera.
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 456, "total_frames": 44875, "total_tasks": 1, "total_videos": 912, "total_chunks": 1,…
Published Oct 2025 · mit · omniretarget
OmniRetarget Dataset: Humanoid Loco-Manipulation & Scene Interaction Paper | Project Page This dataset contains motion trajectories of a G1 humanoid robot interacting with objects and complex terrains.
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Jan 2026 · cc-by-nc-4.0 · nvidia
Website | Model | Dataset | Code | Paper NitroGen Dataset Dataset Description: The NitroGen dataset contains action annotations for publicly available gameplay videos.
Published Sep 2025 · not specified · ShuaiYang03
This repository contains the VLA-IT dataset, a curated 650K-sample Vision-Language-Action Instruction Tuning dataset, and the SimplerEnv-Instruct benchmark.
Published Feb 2026 · cc-by-4.0 · jnogga
BridgeData V2 Teleoperated Demos Teleoperated demonstrations in BridgeData V2. Episodes with inconsistent data or lacking language instructions were discarded, leaving ~86% of all teleoperated episodes.
Published Apr 2026 · not specified · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 151, "total_frames": 89646, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · locht131
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 415, "total_frames": 50744, "total_tasks": 10, "total_videos": 830, "total_chunks": 1,…
Published Jan 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 379, "total_frames": 101469, "total_tasks": 10, "chunks_size": 1000, "data_files_size_in_mb":…
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 537, "total_frames": 1835987, "total_tasks": 2, "chunks_size": 1000,…
Published Feb 2025 · apache-2.0 · physical-intelligence
This dataset was created using LeRobot. Dataset Description This dataset is a lerobot conversion of the aloha_pen_uncap_diverse subset of BiPlay.
Published Jun 2025 · cc-by-4.0 · nvidia
GraspGen: Scaling Sim2Real Grasping GraspGen is a large-scale simulated grasp dataset for multiple robot embodiments and grippers.
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1482, "total_frames": 38240, "total_tasks": 1, "total_videos": 5928, "total_chunks": 2,…
Published Mar 2026 · apache-2.0 · SUZ-tsinghua
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 297520, "total_tasks": 500, "chunks_size": 1000, "data_files_size_in_mb":…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total_videos": 1118, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 11834, "total_frames": 610907, "total_tasks": 1, "chunks_size": 1000, "fps": 3, "splits": {…
Published Jan 2026 · cc-by-4.0 · BeingBeyond
This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total_videos": 1140, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · yinchenghust
This dataset was created using LeRobot. It contains embodied Chain-of-Thought (CoT) demonstrations for the LIBERO benchmark, featuring paired reasoning and action traces.
Published Mar 2026 · cc-by-nc-sa-4.0 · chris1004336379
360DVO Dataset This is the official dataset from the paper 360DVO: Deep Visual Odometry for Monocular 360-Degree Camera, which is published on IEEE Robotics and Automation Letters (RA-L) 2026.
Published Mar 2026 · apache-2.0 · SUZ-tsinghua
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 2500, "total_frames": 549787, "total_tasks": 2409, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "total_videos": 576, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "robocasa", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 50, "chunks_size": 1000,…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "total_videos": 576, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · mfisch3r
Pick & Place Cube Dataset Collected for Agibot G2 robot
Published Mar 2026 · cc-by-nc-sa-4.0 · InternRobotics
Viewer is explicitly configured to read the parquet split only. Citation If you find this dataset useful, please cite: @article{zhao2026SythnVerse, title={SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking},…
Published Apr 2026 · apache-2.0 · DORLR
This dataset was created with the IsaacLab-WireManagement scripted agent. Total time: 45 minutes 56.85 seconds to generate 506 episodes.
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 2955, "total_frames": 241059, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total_videos": 1140, "total_chunks": 1,…
Published Nov 2024 · mit · Fanqi-Lin
Raw GoPro Videos for Four Robotic Manipulation Tasks [Project Page] [Paper] [Code] [Models] [Processed Dataset] This repository contains raw GoPro videos of robotic manipulation tasks collected in-the-wild using UMI, as described in the…
Published Apr 2026 · apache-2.0 · vedpatwardhan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 150, "total_frames": 7800, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb": 100,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 484, "total_frames": 20405, "total_tasks": 1, "total_videos": 484, "total_chunks": 1,…
Published Feb 2026 · apache-2.0 · leggedrobotics
WildOS Frontiers Dataset Dataset Description This dataset provides visual frontier annotations for outdoor long-range navigation, created for WildOS: Open-Vocabulary Object Search in the Wild.
Published Apr 2026 · apache-2.0 · av120
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 499, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Feb 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total_videos": 908, "total_chunks": 1,…
Published Aug 2025 · cc-by-sa-4.0 · GraspClutter6D
GraspClutter6D Dataset GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes Seunghyeok Back, Joosoon Lee, Kangmin Kim, Heeseon Rho, Geonhyup Lee, Raeyoung Kang, Sangbeom Lee, Sangjun Noh,…
Published Apr 2026 · apache-2.0 · SeonghuJeon
DROID 1.0.1 — Preprocessed Mirror A ready-to-use mirror of lerobot/droid_1.0.1 with two preprocessing fixes baked in: Fixed meta/episodes/*.parquet timestamps.
Published Mar 2025 · apache-2.0 · haosulab
ManiSkill2 Data Update: ManiSkill 3 has been released https://github.com/haosulab/ManiSkill/. It uses different datasets than ManiSkill2 so the data here is not expected to transfer over ManiSkill2 is a unified benchmark for learning…
Published Mar 2026 · mit · Lusmse
Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to-synthetic data ratios.
Published Sep 2025 · apache-2.0 · MarkusWuenstel
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 2, "total_frames": 3394, "total_tasks": 1, "total_videos": 6, "total_chunks": 1, "chunks_size":…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 139, "total_frames": 8277, "total_tasks": 3, "chunks_size": 1000, "fps": 30, "splits": {…
Published Jun 2025 · mit · arth-shukla
ManiSkill-HAB TidyHouse Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB TidyHouse.
Published Apr 2026 · apache-2.0 · andreaskoepf
DK-1 Merged Dataset This dataset was created using LeRobot. Dataset Description Merged and deduplicated DK-1 bimanual robot dataset. All source videos are re-encoded to 640x360 h264 with 14D joint-space actions.
Published Feb 2026 · apache-2.0 · robotics-diffusion-transformer
Dataset Summary This dataset provides shards in the WebDataset format for fine-tuning RDT-2 or other policy models on bimanual manipulation.
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1804, "total_frames": 338188, "total_tasks": 24, "chunks_size": 1000, "fps": 10, "splits": {…
Published Jun 2025 · apache-2.0 · youliangtan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 1, "total_frames": 1358, "total_tasks": 1, "total_videos": 2, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 60000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Apr 2026 · apache-2.0 · jdoo2
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Panda", "total_episodes": 108, "total_frames": 44097, "total_tasks": 1, "total_videos": 324, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · DRMNmadhan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120066, "total_tasks": 1, "total_videos": 300, "total_chunks": 1,…
Published Mar 2026 · cdla-permissive-2.0 · BAAI-Humanoid
MOSAIC Dataset Project Page | Paper | Code | Dataset | Model This repository releases the built-in MOSAIC multi-source motion dataset in the following paper: MOSAIC: Bridging the Sim-to-Real Gap in Generalist Humanoid Motion Tracking and…
Published Apr 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 290, "total_frames": 814263, "total_tasks": 7, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · DAVIAN-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 356, "total_frames": 103656, "total_tasks": 16, "chunks_size": 1000, "data_files_size_in_mb":…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_videos": 480, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · Traly
RoboTwin2.0 - LeRobot v3.0 RoboTwin2.0 converted to LeRobot v3.0 format. Contains both joint-space and end-effector (EE) pose data for bimanual manipulation across 5 robot embodiments.
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total_videos": 480, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 235, "total_frames": 224240, "total_tasks": 1, "chunks_size": 1000,…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Gripper", "total_episodes": 201, "total_frames": 172649, "total_tasks": 1, "total_videos": 804,…
Published Apr 2026 · apache-2.0 · ClOBOT
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 100, "total_frames": 147066, "total_tasks": 1, "total_videos": 300,…
Published Dec 2025 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Description Repository: X-VLA License: Apache 2.0 Paper: Zheng et al., 2025, “X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model” (arXiv:2510.10274)…
Published Jul 2025 · mit · iis-esslingen
The ROVER Visual SLAM Benchmark Paper News [2024/12/05] Initial code release. [2025/05/20] ROVER is accepted to IEEE Transactions on Robotics.
Published Feb 2026 · apache-2.0 · Juelg
Simulated Franka Pick-Cube Tactile Dataset The dataset was generated using the Robot Control Stack (RCS). RCS is a flexible Gymnasium wrapper-based robot control interface made for robot learning and specifically Vision-Language-Action…
Published Aug 2025 · mit · behavior-robot-suite
Dataset Card for BEHAVIOR Robot Suite (BRS) Data This dataset provides robotic trajectories for five real-world household tasks.
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 135, "total_frames": 68913, "total_tasks": 3, "total_videos": 270, "total_chunks": 1,…
Published Apr 2026 · mit · Embodied-CoT
Dataset for Embodied Chain-of-Thought Reasoning for LIBERO-90, as used by ECoT-Lite. TFDS Demonstration Data The TFDS dataset contains successful demonstration trajectories for LIBERO-90 (50 trajectories for each of 90 tasks).
Published Apr 2026 · apache-2.0 · Celina717
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "uav", "total_episodes": 799, "total_frames": 79900, "total_tasks": 2, "total_videos": 0, "total_chunks": 1,…
Published Apr 2026 · cc-by-4.0 · nvidia
Dataset Description The Physical AI NuRec dataset seeks to empower robotic researchers to build the next generation of physical AI based end-to-end robotic models.
Published Apr 2026 · apache-2.0 · HollyTan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "data_files_size_in_mb":…
Published Mar 2025 · mit · colosseum
Colosseum Dataset Card This dataset contains demonstrations for training and testing Imitation Learning based policies, taken from our simulation benchmark Colosseum, which is based on RLBench.
Published Mar 2026 · apache-2.0 · villekuosmanen
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 200, "total_frames": 47865, "total_tasks": 1, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018].
Published Apr 2026 · apache-2.0 · azaracla
Community Dataset v1 (v3.0) A large-scale community-contributed robotics dataset for vision-language-action learning, featuring 119 datasets from 52 contributors worldwide.
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1355, "total_frames": 67750, "total_tasks": 3, "total_videos": 1355, "total_chunks": 2,…
Published Mar 2026 · apache-2.0 · Fiberal
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 50, "total_frames": 52868, "total_tasks": 1, "chunks_size": 1000,…
Published Oct 2025 · mit · zjunlp
🌊 OceanGym 🦾 A Benchmark Environment for Underwater Embodied Agents 🌐 Home Page 📄 ArXiv Paper 🤗 Hugging Face ☁️ Google Drive ☁️ Baidu Drive OceanGym is a high-fidelity embodied underwater environment that simulates a realistic ocean…
Published Oct 2025 · mit · k1000dai
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "total_videos": 135, "total_chunks": 1,…
Published Apr 2025 · apache-2.0 · aopolin-lv
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 432, "total_frames": 52970, "total_tasks": 10, "total_videos": 864, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · Gongsta
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 532, "total_frames": 1106053, "total_tasks": 1, "chunks_size": 1000,…
Published Jan 2026 · apache-2.0 · Fizzi789
Humanoid Everyday A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Overview Humanoid Everyday is a large-scale, diverse humanoid manipulation dataset designed for open-world robotic learning and embodied intelligence.
Published Mar 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Mar 2026 · apache-2.0 · tomo202
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 108, "total_frames": 16645, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19359 (炬亮启航) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 428, "total_frames": 52042, "total_tasks": 10, "chunks_size": 1000, "fps": 10, "splits": {…
Published Mar 2026 · mit · POSE-Lab
IndustryShapes Project Page | Paper IndustryShapes is a new benchmark dataset tailored for 6D object pose estimation in industrial settings.
Published Jan 2026 · apache-2.0 · gpudad
SO101 Pick Cube Dataset (Chunked) This is a restructured version of the gpudad/so101_pick_cube dataset with episode-level video files for faster data loading during training. Why Chunked?
Published Apr 2026 · apache-2.0 · BrunoM42
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 25307, "total_frames": 14957899, "total_tasks": 1221, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 150, "total_frames": 3970, "total_tasks": 8, "total_videos": 150, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "total_videos": 3000, "total_chunks": 2,…
Published May 2025 · not specified · nvidia
PhysicalAI-Robotics-Manipulation-Objects is a dataset of automatic generated motions of robots performing operations such as picking and placing objects in a kitchen environment.
Published Nov 2025 · apache-2.0 · fracapuano
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "R1Pro", "total_episodes": 2000, "total_frames": 21227314, "total_tasks": 10, "chunks_size": 1000,…
Published Dec 2025 · apache-2.0 · aractingi
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95658, "total_frames": 27630375, "total_tasks": 49630, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · aivanni
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 37, "total_frames": 36234, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19353 (模型学的全队) Competition task: 单词拼写 (Word Spelling) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published May 2025 · not specified · Richard-Nai
Datasets for OneTwoVLA [Project Page] | [Paper] | [Code] This repository provides datasets collected with the UMI, converted into the LeRobot data format, along with synthetic vision-language data used in the paper OneTwoVLA: A Unified…
Published Jan 2024 · cc-by-4.0 · charlesxu0124
Functional Manipulation Benchmark This robot learning dataset is a part of the paper "FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning".
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 29400, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Jan 2026 · cc-by-4.0 · BeingBeyond
This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "total_videos": 830, "total_chunks": 1,…
Published Sep 2025 · mit · wjh-svm
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark 📄 arXiv | 🐙 GitHub | 💾 Baidu Netdisk | 🤗 Hugging Face Griffin is the pioneering publicly available dataset for aerial-ground cooperative 3D perception.
Published Nov 2025 · cc-by-4.0 · leggedrobotics
NaviTrace 🏠 Project 📄 Paper 💻 Code 🏆 Leaderboard NaviTrace is a novel VQA benchmark for VLMs that evaluates models on their embodiment-specific understanding of navigation across challenging real-world scenarios.
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1447, "total_frames": 699432, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…
Published Apr 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Dual", "total_episodes": 254, "total_frames": 178104, "total_tasks": 1, "total_videos": 762, "total_chunks":…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 39693, "total_tasks": 1, "chunks_size": 1000, "fps": 30, "splits": {…
Published Jan 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 32708, "total_tasks": 3, "total_videos": 240, "total_chunks": 1,…
Published Oct 2025 · apache-2.0 · imstevenpmwork
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_follower", "total_episodes": 51, "total_frames": 16267, "total_tasks": 1, "chunks_size": 1000, "fps": 30,…
Published Oct 2025 · mit · Sylvest
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models 📄 Paper | 🏗️ Repo | 🌐 Website 🔥 Overview This repository contains the official implementation and benchmark for our paper "In-depth Robustness Analysis for…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 36013, "total_tasks": 1, "chunks_size": 1000, "fps": 30, "splits": {…
Published Jan 2026 · apache-2.0 · ygtxr1997
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "total_videos": 6484, "total_chunks": 4,…
Published Apr 2026 · apache-2.0 · andreaskoepf
This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks in the container Episodes: 101 Total…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 365, "total_frames": 34448, "total_tasks": 1, "total_videos": 730, "total_chunks": 1,…
Published Dec 2025 · cc-by-4.0 · nvidia
Dataset Description: The Arena-G1-Loco-Manipulation-Task dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (G1) loco-manipulation task in IsaacLab-Arena environment.
Published Mar 2026 · apache-2.0 · Gongsta
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 61, "total_frames": 120265, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · not specified · BeingBeyond
This data is a subset of the pretraining data for Being-H0.5. Citation Being-H0.5 @article{beingbeyond2026beingh05, title={Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization}, author={Luo, Hao and Wang, Ye…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 101, "total_frames": 45500, "total_tasks": 1, "total_videos": 404, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 1995, "total_frames": 187507, "total_tasks": 3, "chunks_size": 1000, "fps": 10, "splits": {…
Published Apr 2026 · apache-2.0 · aivanni
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 53, "total_frames": 47059, "total_tasks": 1, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 199, "total_frames": 1990, "total_tasks": 3, "total_videos": 398, "total_chunks": 1,…
Published Feb 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 554, "total_frames": 428498, "total_tasks": 11, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · erl-hub
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "reachy2", "total_episodes": 25, "total_frames": 6652, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Feb 2025 · mit · rjgpinel
Dataset Card for GEMBench dataset 💎 GEneralizable vision-language robotic Manipulation Benchmark Dataset A benchmark to systematically evaluate generalization capabilities of vision-and-language robotic manipulation policies.
Published Apr 2026 · other · amathislab
MuscleMimic GMR Retargeted Motions Pre-retargeted motion capture data for the MyoBimanualArm musculoskeletal model, generated using General Motion Retargeting (GMR).
Published Apr 2026 · apache-2.0 · locht131
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 413, "total_frames": 49801, "total_tasks": 10, "total_videos": 826, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1003, "total_frames": 325699, "total_tasks":1, "total_videos": 1003, "total_chunks": 2,…
Published Dec 2025 · mit · TESS-Computer
Minecraft VLA Stage 1: Action Pretraining Data Vision-Language-Action training data for Minecraft, processed from OpenAI's VPT contractor dataset.
Published Oct 2025 · mit · attilczuk
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…
Published Jan 2026 · apache-2.0 · dtttttiiiii
ArtVIP dataset card Key Features ✅ 206 high-quality digital-twin articulated objects. ✅ 6 pre-configured digital-twin scenes, 6 user-defined scenes ✅ Reusable Modular interaction ✅ Physics fidelity ✅ Pixel-level affordance annotations…
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 631, "total_frames": 1055899, "total_tasks": 2, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 20000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Dec 2025 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 25000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 7331, "total_frames": 156012, "total_tasks": 1, "chunks_size": 1000, "fps": 5, "splits": {…
Published Apr 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 100, "total_frames": 50000, "total_tasks": 1, "total_videos": 400, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · giacomoran
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so101_leader", "total_episodes": 200, "total_frames": 47567, "total_tasks": 1, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 170, "total_frames": 7148, "total_tasks": 17, "chunks_size": 1000, "fps": 5, "splits": {…
Published Apr 2025 · mit · WendiChen
Dataset of Reactive Diffusion Policy Contents Description Structure Usage Tactile Dataset Description This is the raw and postprocessed dataset used in the paper Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for…
Published Nov 2025 · cc-by-4.0 · oxe-auge
bridge_train_0_5000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,417 Splits: train: 0:5000 Data Layout data_path :…
Published Jan 2026 · apache-2.0 · KeWangRobotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "libero_panda", "total_episodes": 500, "total_frames": 138090, "total_tasks": 10, "chunks_size": 1000,…
Published Jan 2026 · cc-by-4.0 · DAGroup-PKU
Rethinking Video Generation Model for the Embodied World 🔍 Benchmark Overview The benchmark is constructed from two complementary perspectives: task categories and robot embodiment types, covering a total of 650 image-text evaluation…
Published Feb 2026 · mit · keivalya
Nemotron-VLA MetaWorld Expert Demonstrations Expert demonstration dataset for training Vision-Language-Action (VLA) models on MetaWorld robot manipulation tasks. Built for the Nemotron-VLA project.
Published Apr 2026 · apache-2.0 · fecasado
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "blueberry_ros", "total_episodes": 80, "total_frames": 29986, "total_tasks": 1, "chunks_size": 1000,…
Published Jan 2025 · not specified · USC-PSI-Lab
Humanoid-X 🌐 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repo contains the officail dataset for the paper "Learning from Massive Human Videos for Universal Humanoid Pose Control" If you like our project, please give us a…
Published Jan 2026 · mit · Vigar001
UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories Yanghong Mei*1,5, Yirong Yang*2, Longteng Guo†1, Qunbo Wang3, Ming-Ming Yu2, Xingjian He1, Wenjun Wu2,4, Jing Liu1,5, 1Institute of Automation, Chinese…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_videos": 480, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 136, "total_frames": 27808, "total_tasks": 1, "total_videos": 272, "total_chunks": 1,…
Published Sep 2025 · cc-by-4.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 201, "total_frames": 32429, "total_tasks": 193, "total_videos": 201, "total_chunks": 1,…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 435, "total_frames": 18196, "total_tasks": 1, "total_videos": 435, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · macrodata
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Mar 2026 · apache-2.0 · macrodata
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 5, "total_frames": 3000, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Apr 2026 · apache-2.0 · HecklesL
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…
Published Apr 2026 · apache-2.0 · alex-cta
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "cta_ur_follower", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "total_videos": 0, "total_chunks": 0,…
Published Apr 2026 · apache-2.0 · Purple69
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "AR4_MK3", "total_episodes": 2916, "total_frames": 337627, "total_tasks": 72, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · andreaskoepf
This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks in the container Episodes: 106 Total…
Published Jul 2025 · mit · autobio-bench
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 100, "total_frames": 55127, "total_tasks": 10, "total_videos": 200, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 35000, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Aug 2024 · not specified · cadene
This dataset was created using 🤗 LeRobot.
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19159 (Robot大王) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 55, "total_frames": 110000, "total_tasks": 1, "total_videos": 165, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 102, "total_frames": 7490, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": {…
Published Feb 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 258, "total_frames": 595888, "total_tasks": 1, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 56, "total_frames": 16800, "total_tasks": 1, "total_videos": 224, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 25, "total_frames": 8750, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 201, "total_frames": 234959, "total_tasks": 1, "total_videos": 804,…
Published Mar 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "unknown", "total_episodes": 49, "total_frames": 34300, "total_tasks": 1, "chunks_size": 1000, "fps": 50, "splits": {…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Apr 2026 · apache-2.0 · pvrohin
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ur5e_aic", "total_episodes": 50, "total_frames": 91714, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 55000, "total_tasks": 1, "total_videos": 150, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · Jiaxin1234
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…
Published Apr 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Apr 2026 · apache-2.0 · Zekai-Chen
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 601, "total_frames": 637163, "total_tasks": 1, "chunks_size": 1000,…
Published Jun 2025 · apache-2.0 · fbeltrao
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 10, "total_frames": 4539, "total_tasks": 1, "total_videos": 20, "total_chunks": 1,…
Published Jan 2026 · apache-2.0 · lucanunz
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 980, "total_frames": 106944, "total_tasks": 10, "total_videos": 3920, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018].
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "fanuc_mate", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "total_videos": 830, "total_chunks": 1,…
Published Dec 2025 · not specified · gate-institute
GATE-VLAP Datasets Grounded Action Trajectory Embeddings with Vision-Language Action Planning This repository contains preprocessed datasets from the LIBERO benchmark suite in WebDataset TAR format, specifically designed for training…
Published Apr 2025 · apache-2.0 · HuaihaiLyu
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 598, "total_frames": 285571, "total_tasks": 9, "total_videos": 1794, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · DRMNmadhan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 120081, "total_tasks": 1, "total_videos": 300, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · taetae77
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 304, "total_frames": 302309, "total_tasks": 1, "chunks_size": 1000,…
Published Jul 2025 · apache-2.0 · LightwheelAI
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 60, "total_frames": 36293, "total_tasks": 1, "total_videos": 120, "total_chunks": 1,…
Published Dec 2025 · apache-2.0 · Johnbosco20
Lerobot Community Datasets v3 - A Cross-Embodiment Pretraining Dataset for Vision Language Action Models A large-scale robotics dataset for vision-language-action learning, featuring 791 datasets across 46 robot types, enabling…
Published Apr 2026 · apache-2.0 · d3d3shan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 40, "total_frames": 22683, "total_tasks": 1, "chunks_size": 1000,…
Published May 2025 · apache-2.0 · iantc104
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 200, "total_frames": 30000, "total_tasks": 1, "total_videos": 1200, "total_chunks": 1,…
Published Jan 2026 · apache-2.0 · AiSaurabhPatil
OpenArm Pick v6 - LeRobot Dataset A LeRobot v2.1 dataset for bimanual robot manipulation, recorded in Isaac Sim with teleoperation.
Published Mar 2026 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Important Notes: This is a G1 diversity dataset that can be used for video generation models, world models, and other applications [Lee et al., 2018].
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_Z1_Single", "total_episodes": 1265, "total_frames": 989335, "total_tasks": 1, "total_videos": 2530,…
Published Mar 2026 · apache-2.0 · villekuosmanen
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5_follower", "total_episodes": 80, "total_frames": 24266, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · cloudwalk-research
Psi0 Apple-to-Plate VR Teleoperation Dataset Human-demonstrated loco-manipulation trajectories for fine-tuning the Psi0 Vision-Language-Action (VLA) model on a Unitree G1 humanoid robot in Isaac Lab simulation.
Published Mar 2026 · not specified · wintermelontree
From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning (DICE-RL) This repository contains the datasets used in the paper From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning.
Published Mar 2026 · apache-2.0 · lerobot-data-collection
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "openarms_follower", "total_episodes": 1200, "total_frames": 3254196, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · Pantheon-inc
teleop-piper-0411 Teleoperation dataset for the Agilex Piper robotic arm, collected April 11, 2026 across 3 rigs. Dual-camera (ego wrist + exo third-person), LeRobot v2 format.
Published Mar 2026 · mit · Lusmse
Capstone VLA Datasets LeRobot v2.1 datasets for a liquid pouring task on a KuavoV4Pro humanoid robot (34-DOF, bimanual). Used to benchmark GR00T N1.6, pi0.5, and Diffusion Policy under varying real-to-synthetic data ratios.
Published Apr 2026 · apache-2.0 · KentLuo
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "metaworld", "total_episodes": 2500, "total_frames": 204806, "total_tasks": 49, "chunks_size": 1000, "fps": 80, "splits":…
Published Apr 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 221, "total_frames": 638395, "total_tasks": 7, "chunks_size": 1000,…
Published May 2025 · apache-2.0 · cadene
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 95584, "total_frames": 27607757, "total_tasks": 49596, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 338, "total_frames": 364063, "total_tasks": 11, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · ClOBOT
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Inspire", "total_episodes": 0, "total_frames": 0, "total_tasks": 0, "total_videos": 0, "total_chunks": 0,…
Published Apr 2026 · apache-2.0 · jayshim
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "total_videos": 0, "total_chunks": 2,…
Published Nov 2024 · not specified · Fanqi-Lin
Robotic Manipulation Datasets for Four Tasks [Project Page] [Paper] [Code] [Models] [Raw GoPro Videos] This repository contains in-the-wild robotic manipulation datasets collected using UMI, and processed through a SLAM pipeline, as…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 65000, "total_tasks": 1, "total_videos": 150, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · shivakanthsujit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "aloha", "total_episodes": 10, "total_frames": 2568, "total_tasks": 1, "total_videos": 30, "total_chunks": 1,…
Published Apr 2025 · apache-2.0 · HuaihaiLyu
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 390, "total_frames": 219625, "total_tasks": 19, "total_videos": 1170, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · Xense
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_flexiv_rizon4_rt", "total_episodes": 35, "total_frames": 315631, "total_tasks": 1, "chunks_size": 1000,…
Published Aug 2025 · mit · chrisyrniu
Human2LocoMan: Learning Versatile Quadrupedal Manipulation with Human Pretraining Yaru Niu1,* Yunzhe Zhang1,* Mingyang Yu1 Changyi Lin1 Chenhao Li1 Yikai Wang1 Yuxiang Yang2 Wenhao Yu2 Tingnan Zhang2 Zhenzhen Li3 Jonathan Francis1,3…
Published Nov 2025 · cc-by-4.0 · SITL-Eng
Comprehensive Robotic Cholecystectomy Dataset (CRCD) The Comprehensive Robotic Cholecystectomy Dataset (CRCD) is a large-scale, multimodal dataset for robot-assisted surgery (RAS) research.It provides synchronized endoscopic videos, da…
Published Mar 2025 · cc-by-nc-sa-4.0 · MimicGen
DexMimicGen Datasets This repository contains the official dataset release of simulation environments and datasets for the ICRA 2025 paper "DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning".
Published Dec 2025 · not specified · BAAI-DataCube
agibot_task_463 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 清洁微波炉和抽油烟机。 total_episodes: 1780 total_tasks: 1 size: 57G Dataset Structure ├── data │ └──…
Published Dec 2025 · not specified · BAAI-DataCube
agibot_task_537 This dataset converts the AgiBot format uniformly into LeRobot V3.0. Dataset Statistics robot_name: G1 end_effector: 夹爪 task: 整理床铺 total_episodes: 751 total_tasks: 1 size: 64G Dataset Structure ├── data │ └── chunk-xxx │…
Published Feb 2026 · apache-2.0 · FedorX8
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "chunks_size": 1000, "fps": 10, "splits": {…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Mar 2026 · apache-2.0 · JennyWWW
notes to self: this is using the faster teleop (aka no velocity limiting) This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lerobot_splatsim", "total_episodes": 1012,…
Published Jul 2025 · not specified · TeleEmbodied
HumanoidGen-Dataset 🔥 Homepage | ⛁ Dataset | 🤗 Models | 📑 Paper | 💻 Code This repository provides datasets and resources for the HumanoidGen framework, enabling bimanual dexterous manipulation research.
Published Jun 2025 · mit · arth-shukla
ManiSkill-HAB SetTable Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB SetTable.
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 50, "total_frames": 34112, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…
Published Aug 2025 · apache-2.0 · saaduddinM
Language Table (LeRobot) — Task-Pruned, Reindexed Subset This release is a task-pruned subset of the original IPEC-COMMUNITY/language_table_lerobot.
Published Mar 2026 · cc-by-nc-4.0 · DynamicIntelligence
Dynamic Intelligence — Humanoid Robot Training Dataset A first-person (egocentric) video dataset of human hand manipulation, designed for training humanoid robot policies via imitation learning.
Published Mar 2026 · apache-2.0 · cupnb
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "ros2", "total_episodes": 357, "total_frames": 334477, "total_tasks": 4, "chunks_size": 1000, "data_files_size_in_mb":…
Published Apr 2026 · apache-2.0 · andreaskoepf
This dataset was created using LeRobot. Dataset Description Place Lego Duplo bricks in a container Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: place the duplo bricks in the container Episodes: 101 Total…
Published Nov 2025 · cc-by-4.0 · oxe-auge
bridge_train_20000_25460_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,460 Frames: 185,428 Splits: train: 0:5460 Data Layout data_path :…
Published Apr 2026 · apache-2.0 · learner1119
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ffw_sh5", "total_episodes": 2, "total_frames": 184, "total_tasks": 1, "total_videos": 2, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 10, "total_frames": 6000, "total_tasks": 1, "total_videos": 40, "total_chunks": 1,…
Published Jul 2025 · mit · autobio-bench
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 106292, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · HecklesL
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "fps": 10.0, "splits": {…
Published May 2025 · apache-2.0 · Dongping-Li
EMMOE-100 Trainset Resources Project Paper Code Model Dataset Dataset Feature Task Attributes Task Example Dataset Structure EMMOE-100/ ├── README.md ├── assets/ ├── data/ │ └── train/ │ ├── 1/ │ │ ├── info.txt │ │ ├── info_re1.txt │ │ ├──…
Published Sep 2025 · apache-2.0 · IliaLarchenko
This dataset is used for the demonstration of the Vision Language Action model finetuning. It is collected using a modified version of LeKiwi with 3 cameras, but technically, only the arm is used, so it can be treated as a dataset for…
Published Apr 2026 · apache-2.0 · unitreerobotics
Data Structure Observations observation.state.ee_state (12) End-effector states of the robot. Computed via forward kinematics (FK) from the root link to the left and right end-effectors. Includes the contribution of the waist.
Published Mar 2026 · apache-2.0 · robometer
RBM-1M-OOD evaluation dataset used in Robometer. It contains over 1k trajectories used for evaluation of general-purpose reward models.
Published Mar 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 222, "total_frames": 384490, "total_tasks": 1, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 80, "total_frames": 11522, "total_tasks": 1, "total_videos": 80, "total_chunks": 1,…
Published May 2025 · apache-2.0 · DAbraham
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so100", "total_episodes": 2, "total_frames": 1731, "total_tasks":1, "total_videos": 6, "total_chunks": 1, "chunks_size":…
Published Apr 2026 · apache-2.0 · rajeshramana
LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original?
Published Jun 2025 · not specified · fbeltrao
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 49, "total_frames": 23490, "total_tasks": 1, "total_videos": 98, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · apaszynska
merged_diffrenet_grips Dataset for RECAP (RL with Experience and Corrections via Advantage-conditioned Policies) training.
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "aloha", "total_episodes": 50, "total_frames": 75000, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Sep 2025 · cc-by-4.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 70, "total_frames": 1514, "total_tasks": 2, "total_videos": 70, "total_chunks": 1,…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total_videos": 480, "total_chunks": 1,…
Published Oct 2025 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100", "total_episodes": 56, "total_frames": 22956, "total_tasks": 1, "chunks_size": 1000, "fps": 30, "splits": {…
Published Apr 2026 · apache-2.0 · himanshu9series
LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of LightwheelAI/leisaac-pick-orange ready for GR00T N1.6 fine-tuning. What's Different from the Original?
Published Jan 2026 · apache-2.0 · Juelg
Maniskill Sub-Dataset in RLDS Format used in RPD This repository contains the maniskill subset in RLDS format used to train Octo and OpenVLA in the paper Refined Policy Distillation: From VLA Generalists to RL Experts, which distilled…
Published Apr 2026 · apache-2.0 · binsabit
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 101, "total_frames": 21006, "total_tasks": 4, "chunks_size": 1000, "fps": 8, "splits": {…
Published Dec 2025 · apache-2.0 · LightwheelAI
This dataset was created using LeRobot. 1 Dataset Description This dataset includes 117 Lightwheel-Libero-Tasks and 89 Lightwheel-Robocasa-Tasks, collected using the x7s robot in environments provided by LW-BenchHub.The robot configuration…
Published Feb 2026 · apache-2.0 · tma-hiverobots
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Unitree_G1_Dex3", "total_episodes": 124, "total_frames": 13344, "total_tasks": 3, "chunks_size": 1000,…
Published Mar 2025 · apache-2.0 · IPEC-COMMUNITY
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "total_videos": 135, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · ywxia
This dataset was created using LeRobot. Dataset Description Data Distribution Overview This figure summarizes the data distribution of the ywxia/fold_combined_gt dataset, auto-generated after each conversion via…
Published Oct 2025 · cc-by-4.0 · oxe-auge
berkeley_autolab_ur5_train_400_500 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,479 Videos: 900 Chunks: 1 Splits: train: 0:100 Data…
Published Apr 2026 · apache-2.0 · dream-79
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "omx_f", "total_episodes": 99, "total_frames": 14858, "total_tasks": 1, "total_videos": 198, "total_chunks": 1,…
Published Apr 2026 · cc-by-nc-4.0 · VadExylos
Exylos Pick & Place Sample Dataset Summary Exylos Pick & Place Sample is a compact multi-view robotics dataset in LeRobot-style structure for a single manipulation task: pick up an object and place it into a container.
Published Apr 2026 · apache-2.0 · villekuosmanen
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "arx5", "total_episodes": 164, "total_frames": 34029, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Apr 2026 · apache-2.0 · buggybrain
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 51, "total_frames": 22577, "total_tasks": 1, "chunks_size": 1000,…
Published Jul 2025 · apache-2.0 · binhng
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 200, "total_frames": 33587, "total_tasks": 40, "total_videos": 1600, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · gaozj
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": null, "total_episodes": 212, "total_frames": 101703, "total_tasks": 2, "total_videos": 636, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 340, "total_frames": 289608, "total_tasks": 9, "chunks_size": 1000,…
Published Jan 2026 · apache-2.0 · ygtxr1997
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 104, "total_videos": 39350,…
Published Mar 2026 · apache-2.0 · apaszynska
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_dk1_follower", "total_episodes": 221, "total_frames": 203035, "total_tasks": 1, "chunks_size": 1000,…
Published Aug 2025 · mit · BeingBeyond
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos We introduce Being-H0, the first dexterous Vision-Language-Action model pretrained from large-scale human videos via explicit hand motion modeling.
Published Nov 2025 · cc-by-4.0 · oxe-auge
iamlab_cmu_pickup_insert_train_500_631_augmented Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, sawyer, ur5e, widowX, xarm7 FPS: 20.0 Episodes: 131 Frames: 30,143 Videos: 1,179 Chunks: 1 Splits:…
Published May 2026 · apache-2.0 · andreaskoepf
This dataset was created using LeRobot. Dataset Description Bimanual duplo disassembly demonstrations with DK1 robot Robot: TRLC DK1 bimanual (bi_dk1_follower) — 2× 6-DOF arms with grippers Task: separate the duplo bricks Episodes: 42…
Published Apr 2026 · apache-2.0 · trietlm0306
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 135, "total_frames": 65250, "total_tasks": 5, "chunks_size": 1000,…
Published Oct 2025 · cc-by-4.0 · oxe-auge
berkeley_autolab_ur5_train_300_400 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,934 Videos: 900 Chunks: 1 Splits: train: 0:100 Data…
Published Apr 2026 · apache-2.0 · leosltl
Android Control (Community Mirror) This is a community mirror of the official Android Control dataset by Google Research, hosted on Hugging Face for easier access.
Published Nov 2025 · cc-by-4.0 · oxe-auge
bridge_test_0_3475_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 3,475 Frames: 118,603 Splits: train: 0:3475 Data Layout data_path :…
Published Apr 2026 · apache-2.0 · yangxinye
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so_follower", "total_episodes": 1, "total_frames": 751, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Mar 2026 · apache-2.0 · mjung11
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks": 4, "chunks_size": 1000,…
Published Mar 2025 · apache-2.0 · AdilZtn
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": null, "total_episodes": 1860, "total_frames": 0, "total_tasks": 0, "total_videos": 0, "total_chunks": 0, "chunks_size":…
Published Nov 2025 · cc-by-4.0 · oxe-auge
bridge_train_15000_20000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 169,127 Splits: train: 0:5000 Data Layout data_path :…
Published Apr 2026 · apache-2.0 · ByteRainx
Bright 2026 Benchmark - Robot 19153 (同济子豪兄 (张子豪团队)) Competition task: 水果分类 (Fruit Sorting) Dataset Format LeRobot v2.1 format with 3 camera views (face, left wrist, right wrist).
Published Nov 2025 · cc-by-4.0 · oxe-auge
bridge_train_5000_10000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 168,737 Splits: train: 0:5000 Data Layout data_path :…
Published Apr 2026 · apache-2.0 · ry-5
FR5 Garlic Manipulation Dataset This dataset contains demonstrations collected on an FR5 robot teleoperated to manipulate garlic.
Published Mar 2026 · apache-2.0 · RonPlusSign
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "dianarm", "total_episodes": 26, "total_frames": 6847, "total_tasks": 1, "chunks_size": 1000, "data_files_size_in_mb":…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 110, "total_frames": 26113, "total_tasks": 216, "total_videos": 110, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "yam_follower", "total_episodes": 88, "total_frames": 287160, "total_tasks": 2, "chunks_size": 1000,…
Published Apr 2026 · cc-by-nc-nd-4.0 · yanglei18
V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception Lei Yang · Xinyu Zhang · Jun Li · Chen Wang · Jiaqi Ma · Zhiying Song · Tong Zhao · Ziying Song · Li Wang · Mo Zhou · Yang Shen · Kai Wu · Chen Lv This is the…
Published Apr 2026 · apache-2.0 · edgarcancinoe
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 30, "total_frames": 27724, "total_tasks": 2, "chunks_size": 1000,…
Published Dec 2025 · apache-2.0 · zilch512
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101", "total_episodes": 9, "total_frames": 289, "total_tasks": 2, "total_videos": 0, "total_chunks": 1, "chunks_size":…
Published Mar 2026 · apache-2.0 · SUZ-tsinghua
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "aloha", "total_episodes": 500, "total_frames": 223728, "total_tasks": 343, "chunks_size": 1000, "data_files_size_in_mb":…
Published Mar 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 111, "total_frames": 62196, "total_tasks": 1, "chunks_size": 1000,…
Published Oct 2025 · apache-2.0 · JUNTAO123
🤖 Custom LeRobot Dataset This dataset was created using LeRobot,and follows the LeRobot v2.1 dataset specification for robotic control and imitation learning.
Published Feb 2026 · apache-2.0 · twoeight
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 101, "total_frames": 247024, "total_tasks": 2, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 18, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · pmoller
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "ur5", "total_episodes": 100, "total_frames": 12696, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · Zekai-Chen
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_so_follower", "total_episodes": 303, "total_frames": 373511, "total_tasks": 1, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 107, "total_frames": 7622, "total_tasks": 1, "total_videos": 107, "total_chunks": 1,…
Published Feb 2026 · apache-2.0 · VLA-Arena
VLA-Arena Dataset (L0 - Small Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models.
Published Dec 2025 · apache-2.0 · VLA-Arena
VLA-Arena Dataset (L1 - Medium Variant) About VLA-Arena VLA-Arena is an open-source benchmark designed for the systematic evaluation of Vision-Language-Action (VLA) models.
Published Mar 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · Factory-Intelligence
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_widowxai_follower_robot", "total_episodes": 128, "total_frames": 74668, "total_tasks": 1, "chunks_size": 1000,…
Published Jun 2025 · cc-by-4.0 · nvidia
Dataset Description: This dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (GR1) tabletop manipulation tasks for industrial settings.
Published Nov 2025 · cc-by-4.0 · oxe-auge
bridge_train_10000_15000_augmented Overview Codebase version: v3.0 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, ur5e, xarm7 FPS: 5 Episodes: 5,000 Frames: 170,583 Splits: train: 0:5000 Data Layout data_path :…
Published Jun 2025 · mit · arth-shukla
ManiSkill-HAB PrepareGroceries Dataset Paper | Website | Code | Models | (Full) Dataset | Supplementary Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB PrepareGroceries.
Published Jul 2025 · mit · autobio-bench
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5e", "total_episodes": 100, "total_frames": 84994, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "Unitree_G1_Brainco", "total_episodes": 197, "total_frames": 220788, "total_tasks": 1, "total_videos": 788,…
Published Dec 2025 · mit · elonelonelon
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "R1Pro", "total_episodes": 10000, "total_frames": 119094660, "total_tasks": 50, "total_videos": 90000, "chunks_size":…
Published Oct 2025 · cc-by-4.0 · oxe-auge
berkeley_autolab_ur5_train_800_896 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 96 Frames: 9,360 Videos: 864 Chunks: 1 Splits: train: 0:96 Data…
Published Apr 2026 · apache-2.0 · n1robotics
This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo.
Published Feb 2026 · apache-2.0 · edgarcancinoe
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "so100_follower", "total_episodes": 240, "total_frames": 180027, "total_tasks": 1, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "franka", "total_episodes": 10977, "total_frames": 3114872, "total_tasks": 295, "chunks_size": 1000,…
Published Mar 2026 · apache-2.0 · correlllab
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": null, "total_episodes": 130, "total_frames": 4435, "total_tasks": 36, "chunks_size": 1000, "data_files_size_in_mb": 100,…
Published Apr 2026 · apache-2.0 · n1robotics
This dataset was created using LeRobot. Dataset Description Visualized using visualize-lerobot.py included in this dataset repo.
Published Apr 2026 · apache-2.0 · Elvinky
bi-so101-insert-screw-271ep Bimanual teleoperation dataset for the task "Insert the copper screw into the black sleeve", collected with SO-101 dual arms and LeRobot 0.5.0.
Published Jun 2025 · apache-2.0 · fbeltrao
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "so101_follower", "total_episodes": 59, "total_frames": 20844, "total_tasks": 2, "total_videos": 118, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · arif101
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "panda", "total_episodes": 272, "total_frames": 11334, "total_tasks": 32, "total_videos": 0, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · yangxinye
real_so101_record_v1 A real-world LeRobot-format dataset collected on an SO101/SO100-style follower arm with three camera views.
Published Apr 2026 · apache-2.0 · villekuosmanen
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "arx5", "total_episodes": 60, "total_frames": 41832, "total_tasks": 1, "total_videos": 120, "total_chunks": 1,…
Published Apr 2026 · apache-2.0 · TrossenRoboticsCommunity
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_mobile", "total_episodes": 48, "total_frames": 28479, "total_tasks": 1,…
Published Oct 2025 · cc-by-4.0 · oxe-auge
berkeley_autolab_ur5_train_0_100 Overview Codebase version: v2.1 Robots: google_robot, images, jaco, kinova3, kuka_iiwa, panda, sawyer, widowX, xarm7 FPS: 5.0 Episodes: 100 Frames: 9,863 Videos: 900 Chunks: 1 Splits: train: 0:100 Data…
Published Nov 2025 · apache-2.0 · DRMNmadhan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 100, "total_frames": 75051, "total_tasks": 1, "total_videos": 200, "total_chunks": 1,…
Published Mar 2026 · apache-2.0 · Mimic-Robotics
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "mimic_follower", "total_episodes": 678, "total_frames": 326999, "total_tasks": 9, "chunks_size": 1000,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 100, "total_frames": 12971, "total_tasks": 1, "total_videos": 100, "total_chunks": 1,…
Published Sep 2025 · mit · lerobot
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 104, "total_frames": 8928, "total_tasks": 14, "total_videos": 104, "total_chunks": 1,…
Published Sep 2025 · apache-2.0 · unitreerobotics
This dataset was created using LeRobot. Due to the inability to precisely describe spatial positions, adjust the scene to closely match the first frame of the dataset after installing the hardware as specified in Part 5 of AVP…
Published Mar 2026 · apache-2.0 · mjung11
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "bi_piper_follower", "total_episodes": 853, "total_frames": 851784, "total_tasks": 4, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · michios
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "Franka", "total_episodes": 80, "total_frames": 33909, "total_tasks": 26, "chunks_size": 1000, "data_files_size_in_mb":…
Published Apr 2026 · apache-2.0 · allday-technology
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "trossen_subversion": "v1.0", "robot_type": "trossen_ai_stationary", "total_episodes": 1, "total_frames": 293, "total_tasks": 1,…
Published Apr 2026 · apache-2.0 · wiscohumanoids
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "lekiwi_client", "total_episodes": 226, "total_frames": 120729, "total_tasks": 3, "chunks_size": 1000,…
Published Apr 2026 · apache-2.0 · DRMNmadhan
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "piper", "total_episodes": 70, "total_frames": 42041, "total_tasks": 1, "total_videos": 210, "total_chunks": 1,…
Published Feb 2026 · apache-2.0 · FedorX8
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605, "chunks_size": 1000, "fps": 10,…
Published Jan 2026 · cc-by-4.0 · BeingBeyond
This data is part of the training data for Being-H0.5, produced by BeingBeyond. License This dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
566 remaining
KEEP DIGGING
A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.
For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.
Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.
TRUELABEL ROUTING
Turn the Hugging Face record into a buyer-ready request with sample QA, rights review, conversion proof, and deployment-fit checks.