truelabelRequest data

Dataset profile

ad1t7a/10Kh-RealOmin-OpenData: Large-Scale Real-World Dual-Arm Robotics Dataset

The 10Kh-RealOmin-OpenData dataset delivers over 10,000 cumulative hours and more than one million video clips of dual-arm robotic manipulation collected from 3,000+ real households. Released under CC-BY-SA-4.0, it provides video modality data across nearly 10,000 distinct fine-grained manipulation targets. Robotics teams use this dataset to train vision-language-action models and world models for in-home service robots because the authentic household scenarios and skill diversity support robust generalization across deployment environments.

Updated 2026-06-29
By Truelabel Team
Reviewed by Truelabel Team ·
dual-arm robotics dataset

Quick facts

Scale
10,000+ hours, 1M+ clips
License
CC-BY-SA-4.0
Modality
Video
Embodiment
Dual-arm robots
Commercial use
Permitted with share-alike
Environments
3,000+ real households

Dataset composition and collection methodology

The 10Kh-RealOmin-OpenData dataset aggregates video recordings from dual-arm robotic systems operating in over 3,000 distinct household environments. Each manipulation skill is supported by substantial sample volume, ensuring that models trained on this data encounter sufficient variation in lighting, clutter, object placement, and household layout. The collection spans nearly 10,000 fine-grained manipulation targets, avoiding the simple repetition patterns that plague smaller or lab-constrained datasets. Video clips capture end-to-end task execution, preserving the temporal dynamics required for reinforcement learning and sequence modeling. The dataset creators prioritized authentic scenarios over curated lab setups, meaning teams will find realistic occlusions, variable object states, and the kinds of edge cases that emerge during actual in-home deployment. This design choice makes the dataset particularly valuable for teams building service robots that must generalize beyond controlled test benches.

Licensing terms and commercial deployment

Released under the Creative Commons Attribution-ShareAlike 4.0 International license, this dataset permits commercial use provided that derivative works carry the same CC-BY-SA-4.0 terms and include proper attribution. For robotics companies training proprietary VLA models, the share-alike clause means that if you distribute a model trained on this data, the model weights or training pipeline must also be shared under CC-BY-SA-4.0. Teams integrating this dataset into closed-loop training workflows should evaluate whether the share-alike obligation aligns with their intellectual-property strategy. The license does allow internal research, prototyping, and service-robot deployments where the trained model remains internal and is not redistributed. Legal and procurement teams should confirm that downstream model-release plans comply with the share-alike requirement, especially if the roadmap includes offering pre-trained manipulation models as commercial products.

Procurement and integration through Truelabel

Organizations sourcing this dataset through Truelabel gain structured metadata, compliance documentation, and integration support that accelerates procurement cycles. Truelabel provides license summaries, sample inspection tooling, and provenance records that satisfy internal review requirements in regulated industries. For teams managing multi-dataset pipelines, Truelabel's unified API reduces the engineering overhead of handling Hugging Face downloads, storage, and versioning. The platform also surfaces usage analytics, helping ML teams track which subsets contribute most to model performance and justify continued investment. Because this dataset exceeds one terabyte, Truelabel's bandwidth optimization and partial-download capabilities can significantly reduce initial download time and cloud egress costs, particularly for teams running distributed training across multiple regions.

Known limitations and deployment considerations

While the dataset's scale and diversity are industry-leading, teams should note that all clips originate from household environments, which may limit direct transferability to warehouse, hospital, or outdoor agricultural settings. The dual-arm embodiment assumption means that single-arm or mobile-manipulator architectures will require domain adaptation or augmentation with complementary datasets. Video-only modality implies that tactile, force-torque, or proprioceptive signals are absent; teams building contact-rich manipulation policies may need to fuse this data with smaller tactile datasets or simulate missing modalities. The dataset description does not specify frame rate, resolution, or camera intrinsics, so teams should inspect sample clips early in the procurement process to confirm compatibility with existing data pipelines. Finally, because data was collected from real households, privacy and consent documentation should be reviewed to ensure compliance with regional data-protection regulations if the trained models will be deployed in jurisdictions with strict AI-training-data rules.

Use these to move from category-level context into specific task, dataset, format, and comparison detail.

FAQ

What exactly is the 10Kh-RealOmin-OpenData dataset and who publishes it?

The 10Kh-RealOmin-OpenData dataset is a large-scale collection of over 10,000 hours and more than one million video clips documenting dual-arm robotic manipulation tasks in real household environments. Published by ad1t7a on Hugging Face, it is presented as the largest open-source embodied intelligence dataset available, with data gathered from over 3,000 distinct homes and nearly 10,000 fine-grained manipulation targets. The dataset is designed to support training of vision-language-action models, world models, and reinforcement-learning agents that must generalize across diverse in-home service scenarios.

What are the exact licensing terms and can I use this dataset commercially?

This dataset is released under the Creative Commons Attribution-ShareAlike 4.0 International license, which permits commercial use with two key obligations: you must provide appropriate attribution to the original dataset creators, and any derivative works—including trained models or modified datasets—must be shared under the same CC-BY-SA-4.0 terms. For robotics companies, this means that if you train a VLA model on this data and then distribute that model publicly or commercially, the model weights and associated training artifacts must also carry the CC-BY-SA-4.0 license. Internal use, research prototypes, and service deployments where the model is not redistributed are generally permissible, but legal review is recommended to ensure compliance with your organization's IP strategy.

Which robotics teams should prioritize this dataset for their training pipelines?

Teams building vision-language-action models or world models for in-home service robots will find this dataset especially valuable because of its scale, diversity, and authentic household scenarios. The 3,000+ home environments and nearly 10,000 manipulation targets provide the variation needed to train models that generalize beyond lab-controlled settings. Dual-arm manipulation researchers benefit directly from the embodiment match, while reinforcement-learning practitioners gain access to long-horizon task sequences captured in video. Organizations with share-alike-compatible licensing strategies and sufficient infrastructure to handle terabyte-scale video datasets should prioritize this resource. Teams targeting warehouse automation, outdoor agriculture, or single-arm mobile manipulators may need to augment with domain-specific data to achieve comparable performance.

When is this dataset NOT the right choice for a robotics project?

This dataset is not ideal if your deployment environment differs significantly from residential households—warehouse, hospital, or outdoor agricultural settings will exhibit different object distributions, lighting, and spatial layouts that are underrepresented here. Single-arm or mobile-manipulator teams will need to invest in domain adaptation since all clips assume a dual-arm embodiment. If your policy requires tactile feedback, force-torque sensing, or proprioceptive signals, the video-only modality will necessitate fusion with additional sensor datasets or simulation. Finally, organizations with strict proprietary-model requirements may find the CC-BY-SA-4.0 share-alike clause incompatible with their IP strategy, especially if the roadmap includes commercial distribution of pre-trained manipulation models.

How should procurement teams handle the dataset's terabyte-scale size during evaluation?

Given that this dataset exceeds one terabyte, procurement and ML engineering teams should plan for substantial download time, storage capacity, and bandwidth costs, particularly when moving data into cloud training environments. Inspecting a representative sample subset before committing to a full download is critical to confirm frame rate, resolution, camera angles, and clip length align with existing data pipelines. Platforms like Truelabel offer partial-download and streaming capabilities that reduce upfront infrastructure burden and allow iterative exploration. Teams should also budget for data versioning and backup, since re-downloading the full dataset after an accidental deletion can incur significant time and cost penalties.

What metadata or documentation should I expect when integrating this dataset?

The dataset's Hugging Face page provides a high-level description emphasizing scale, household diversity, and dual-arm embodiment, but detailed technical specifications—such as frame rate, resolution, camera intrinsics, action-label schemas, or task taxonomies—are not exhaustively documented in the truncated description available. Teams should inspect sample clips early to reverse-engineer these parameters and confirm compatibility with training frameworks. If you source through Truelabel, you gain access to structured metadata exports, compliance summaries, and provenance records that streamline internal review and satisfy audit requirements. Always verify that privacy and consent documentation for the 3,000+ household environments meets your jurisdiction's data-protection regulations before deploying models trained on this data in production.

Need data like ad1t7a/10Kh-RealOmin-OpenData: Large-Scale Real-World Dual-Arm Robotics Dataset?

If your project needs similar modality, scale, or licensing, truelabel can surface comparable open datasets or match you with capture partners that deliver to spec.

Access dataset on Hugging Face