truelabel

COMMERCIAL USE

Commercial use unclear

Commercial use status is a conservative truelabel interpretation for buyer triage. Legal teams still need to review source terms, contributor consent, and downstream model use rights.

DIRECT ANSWER

Dataset license text alone rarely answers whether a physical AI team can train, evaluate, redistribute, or commercialize a model. This grouping organizes datasets by truelabel’s buyer-readiness risk signal.

custom

Open X-Embodiment

A large cross-institution collection of robot demonstrations spanning many embodiments and manipulation tasks.

  • Unclear
  • Unknown
  • RGB-D

custom

DROID

A real-world robot manipulation dataset focused on diverse teleoperated demonstrations outside narrow lab-only settings.

  • Unclear
  • Unknown
  • Teleoperation

custom

BridgeData V2

A robot manipulation dataset from Berkeley focused on real-world behavior cloning and task generalization.

  • Unclear
  • Unknown
  • RGB-D

custom

RT-1

A robotics transformer data release associated with language-conditioned robot manipulation research.

  • Unclear
  • Unknown
  • RGB-D

custom

ALOHA

A low-cost bimanual teleoperation platform and dataset family used for imitation learning in dexterous manipulation.

  • Unclear
  • Medium
  • Teleoperation

custom

RoboNet

A multi-robot dataset for visual foresight and manipulation policy research.

  • Unclear
  • Low
  • RGB-D

custom

RLBench

A simulated robot learning benchmark with many manipulation tasks in CoppeliaSim.

  • Unclear
  • Low
  • RGB-D

custom

CALVIN

A benchmark for language-conditioned long-horizon robot manipulation in simulated environments.

  • Unclear
  • Low
  • RGB-D

custom

HOI4D

A 4D egocentric human-object interaction dataset with RGB-D and pose-oriented annotations.

  • Unclear
  • Medium
  • Egocentric video

custom

DexYCB

A dexterous hand-object interaction dataset centered on grasping YCB objects with 3D annotations.

  • Unclear
  • Medium
  • RGB-D

custom

Habitat datasets

A family of embodied AI datasets and simulation assets for navigation and rearrangement research.

  • Unclear
  • Low
  • RGB-D

custom

BEHAVIOR

A benchmark for household activities and embodied AI tasks in simulation.

  • Unclear
  • Low
  • RGB-D

custom

ObjectFolder

A dataset family for object-centric physical properties, geometry, and multimodal perception research.

  • Unclear
  • Low
  • RGB-D

custom

BC-Z

A behavior cloning project focused on zero-shot task generalization for robots.

  • Unclear
  • Unknown
  • RGB-D

custom

DexMV

A dexterous manipulation dataset focused on multi-view visual observations and hand-object interaction.

  • Unclear
  • Medium
  • RGB-D

custom

RH20T

A real-world contact-rich robot manipulation dataset with multimodal sensing, force, audio, and human demonstration video.

  • Unclear
  • Medium
  • Teleoperation

custom

AgiBot World

A large-scale real-world robot manipulation dataset family for fine-grained manipulation, tool use, and multi-robot collaboration.

  • Unclear
  • Unknown
  • Teleoperation

custom

RoboCasa

A large-scale kitchen simulation framework and dataset family for everyday manipulation tasks in diverse household environments.

  • Unclear
  • Low
  • RGB-D

custom

LIBERO

A benchmark suite for lifelong robot learning and language-conditioned manipulation tasks.

  • Unclear
  • Low
  • RGB-D

custom

RoboSet

A real-world multi-task kitchen manipulation dataset with teleoperated and kinesthetic demonstrations.

  • Unclear
  • Medium
  • Teleoperation

custom

RoboTurk

A large-scale teleoperation data collection platform and dataset family for robot manipulation tasks.

  • Unclear
  • Medium
  • Teleoperation

custom

UMI

Universal Manipulation Interface is an in-the-wild human demonstration framework for transferring portable gripper data to robot policies.

  • Unclear
  • Medium
  • Egocentric video

custom

FurnitureBench

A real-world long-horizon furniture assembly benchmark with successful demonstration data.

  • Unclear
  • Low
  • Teleoperation

custom

TACO Play

A kitchen robot manipulation dataset with Franka arm interaction data available through TensorFlow Datasets.

  • Unclear
  • Unknown
  • RGB-D

custom

LeRobot datasets

A Hugging Face robotics dataset ecosystem and standardized dataset format for multimodal robot learning data.

  • Unclear
  • Unknown
  • Teleoperation

FACET REVIEW PATHS

Do not treat this tag as the whole sourcing decision

Facet groupings are discovery aids, not final recommendations. A shared modality, task, robot, format, license, or commercial-use label only says that datasets are worth comparing; it does not prove that the source is safe, complete, or useful for a target model.

Use this grouping to shortlist candidates, then open the dataset profiles, run fit and license checks, and compare sources against the buyer's target environment. Thin tag results become useful only when they route the reader into deeper evidence and action surfaces.

The external references below keep the facet grounded in robotics data practice. They help reviewers understand why format, embodiment, trajectory quality, licensing, and real-world coverage matter before a team commits engineering time to ingestion.

When a facet has only a few matching datasets, treat that as a signal rather than a weakness. It may mean the public corpus is thin for that robot, task, or format, and the next move is a custom supplement with the facet written into acceptance criteria.

INTERNAL LINKS

Continue the buyer workflow

EXTERNAL REFERENCES

Source context to verify