HF AUTHOR CLUSTER

InternRobotics robotics datasets

10 robotics-tagged HF records from InternRobotics, totaling 325,043 cumulative downloads. Some records cite published arxiv research.

DIRECT ANSWER

Author clusters consolidate every record from one publisher into a single buyer-review surface. InternRobotics ships 10 robotics datasets on Hugging Face. Top license: not specified. Of those, 9 get a full standalone page, 0 get a shorter profile, and 1 are folded into this cluster.

Robotics-tagged

Records

Hub signal

Cumulative downloads

325,043

First-pass rights

Top license

not specified

License

Modality

Format

10 of 10 datasets

OmniWorld

Published Apr 2026 · cc-by-nc-sa-4.0 · InternRobotics

[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling 🎉NEWS [2026.3.21] 🔥 OmniWorld-Game with Metric Scale is now released!

200,677 downloads
90 likes
1B<n<10B
Paper available

Image
Text
Webdataset

InternData-N1

Published Feb 2026 · cc-by-sa-4.0 · InternRobotics

InternData-N1 InternData-N1 is a large-scale, unified vision-language navigation dataset consolidating multiple benchmarks into a standardized format.

47,044 downloads
68 likes
n>1T
Gated access

InternData-A1

Published Mar 2026 · not specified · InternRobotics

InternData-A1 InternData-A1 is a hybrid synthetic-real manipulation dataset containing over 630k trajectories and 7,433 hours across 4 embodiments, 18 skills, 70 tasks, and 227 scenes, covering rigid, articulated, deformable, and…

36,451 downloads
88 likes
n>1T
Paper available
Gated access

3d
Image

InternScenes

Published Feb 2026 · not specified · InternRobotics

InternScenes InternScenes is a large-scale interactive indoor scene dataset with realistic layouts. This dataset comprises approximately 40,000 diverse scenes and 1.96M 3D objects that cover 15 common scene types and 288 object classes,…

16,673 downloads
36 likes
1M<n<10M
Gated access

3d
Image
Webdataset

Sim1_Dataset

Published Apr 2026 · apache-2.0 · InternRobotics

Sim1_Dataset LeRobot-format manipulation dataset for cloth folding / deformable object tasks. Format This repository follows a LeRobot-style layout per subset: data/chunk-xxx/episode_XXXXXX.parquet: frame-level trajectories…

8,119 downloads
5 likes
1M<n<10M
Paper available
Gated access

Tabular
Timeseries

InternData-M1

Published Dec 2025 · not specified · InternRobotics

InternData-M1 InternData-M1 is a comprehensive embodied robotics dataset containing 244K simulation demonstrations with rich frame-based information including 2D/3D boxes, trajectories, grasp points, and semantic masks, with comprehensive…

6,154 downloads
30 likes
1M<n<10M
Gated access

Text
Webdataset

RoboInter-Data

Published Feb 2026 · not specified · InternRobotics

RoboInter-Data: Intermediate Representation Annotations for Robot Manipulation Rich, dense, per-frame intermediate representation annotations for robot manipulation, built on top of DROID and RH20T.

5,682 downloads
9 likes
1K<n<10K
Paper available

Text
Video
JSON

MesaTask-10K

Published Sep 2025 · not specified · InternRobotics

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning 🔑Key Features MesaTask-10K, a large-scale dataset for task-oriented tabletop scene generation, comprises approximately 10,700 synthetic tabletop scenes…

1,842 downloads
15 likes
10K<n<100K
Paper available
Gated access

RoboInter-VQA

Published Feb 2026 · not specified · InternRobotics

RobotInter-VQA: Intermediate Representation Understanding & Generation VQA Dataset for Manipulation English | 简体中文 A Visual Question Answering dataset for robotic manipulation, developed as part of the RoboInter, covering generation,…

1,429 downloads
6 likes
not specified
Paper available

SynthVerse

Published Mar 2026 · cc-by-nc-sa-4.0 · InternRobotics

Viewer is explicitly configured to read the parquet split only. Citation If you find this dataset useful, please cite: @article{zhao2026SythnVerse, title={SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking},…

972 downloads
13 likes
1K<n<10K
Paper available

Image
Text
Optimized Parquet

A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.

For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.

Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.

TRUELABEL ROUTING

Need data like InternRobotics ships, but with cleaner rights?

If the Hub records don't carry the license, consent, or deployment fit your team needs, commission a custom collection on the same modality with explicit commercial terms.

Request similar data

InternRobotics robotics datasets

Records

Cumulative downloads

Top license

All 10 robotics records from InternRobotics

OmniWorld

InternData-N1

InternData-A1

InternScenes

Sim1_Dataset

InternData-M1

RoboInter-Data

MesaTask-10K

RoboInter-VQA

SynthVerse

Use this record as part of a broader dataset review

Where to go next

Other places to verify the claims

Need data like InternRobotics ships, but with cleaner rights?