Records
13
HF AUTHOR CLUSTER
13 robotics-tagged HF records from nvidia, totaling 463,897 cumulative downloads. Some records cite published arxiv research.
DIRECT ANSWER
Author clusters consolidate every record from one publisher into a single buyer-review surface. nvidia ships 13 robotics datasets on Hugging Face. Top license: cc-by-4.0. Of those, 13 get a full standalone page, 0 get a shorter profile, and 0 are folded into this cluster.
13
463,897
cc-by-4.0
DATASETS
13 of 13 datasets
Published Mar 2026 · cc-by-4.0 · nvidia
PhysicalAI-Robotics-GR00T-X-Embodiment-Sim Github Repo: Isaac GR00T N1 We provide a set of datasets used for post-training of GR00T N1. Each dataset is a collection of trajectories from different robot embodiments and tasks.
Published Apr 2026 · cc-by-4.0 · nvidia
Dataset Description: Open-H-Embodiment is a community‑driven dataset initiative building the open, shared foundation needed to train and evaluate AI autonomy models for surgical robotics and ultrasound.
Published Mar 2026 · cc-by-4.0 · nvidia
PhysicalAI-Robotics-Manipulation-Kitchen-Demos We provide a 600 hours of human-teleoperated demonstrations across 316 different tasks, totalling 55k trajectories.
Published Jun 2025 · cc-by-4.0 · nvidia
PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams Paper | Paper Website | GitHub Download We provide a download script to download our dataset. If you have enough space, you can use git to download a dataset from huggingface.
Published May 2025 · cc-by-4.0 · nvidia
Dataset Description: PhysicalAI-Robotics-Manipulation-SingeArm is a collection of datasets of automatic generated motions of a Franka Panda robot performing operations such as block stacking, opening cabinets and drawers.
Published Dec 2025 · cc-by-4.0 · nvidia
NVIDIA Physical AI SimReady Warehouse OpenUSD Dataset Dataset Version: 1.1.0 Date: May 18, 2025 Author: NVIDIA, Corporation License: CC-BY-4.0 (Creative Commons Attribution 4.0 International) Contents This dataset includes the following:…
Published May 2025 · cc-by-4.0 · nvidia
PhysicalAI Robotics Manipulation in the Kitchen Dataset Description: PhysicalAI-Robotics-Manipulation-Kitchen is a dataset of automatic generated motions of robots performing operations such as opening and closing cabinets, drawers,…
Published Jan 2026 · cc-by-nc-4.0 · nvidia
Website | Model | Dataset | Code | Paper NitroGen Dataset Dataset Description: The NitroGen dataset contains action annotations for publicly available gameplay videos.
Published Jun 2025 · cc-by-4.0 · nvidia
GraspGen: Scaling Sim2Real Grasping GraspGen is a large-scale simulated grasp dataset for multiple robot embodiments and grippers.
Published Apr 2026 · cc-by-4.0 · nvidia
Dataset Description The Physical AI NuRec dataset seeks to empower robotic researchers to build the next generation of physical AI based end-to-end robotic models.
Published May 2025 · not specified · nvidia
PhysicalAI-Robotics-Manipulation-Objects is a dataset of automatic generated motions of robots performing operations such as picking and placing objects in a kitchen environment.
Published Dec 2025 · cc-by-4.0 · nvidia
Dataset Description: The Arena-G1-Loco-Manipulation-Task dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (G1) loco-manipulation task in IsaacLab-Arena environment.
Published Jun 2025 · cc-by-4.0 · nvidia
Dataset Description: This dataset is multimodal collections of trajectories generated in Isaac Lab. It supports humanoid (GR1) tabletop manipulation tasks for industrial settings.
KEEP DIGGING
A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.
For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.
Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.
TRUELABEL ROUTING
If the Hub records don't carry the license, consent, or deployment fit your team needs, commission a custom collection on the same modality with explicit commercial terms.