HF AUTHOR INDEX

Hugging Face robotics datasets by author

The 1,001 robotics-tagged HF records ship from 38 authors with three or more datasets each. Author clusters are the canonical destination for buyer queries like "Stanford robotics datasets" or "NVIDIA physical AI datasets" that aren't well-served by individual record pages.

DIRECT ANSWER

Each cluster aggregates every record from one author — including ones folded in because the standalone record is too thin to be useful on its own. Clusters surface the author’s license posture, total downloads, top modalities, and any arxiv-cited research records. 667 records across 38 authors.

30 datasets · 804,037 downloads

IPEC-COMMUNITY

Top license: apache-2.0 · arxiv-cited research

Video
Tabular
Timeseries

6 datasets · 659,048 downloads

cadene

Top license: apache-2.0 · arxiv-cited research

Video
Tabular
Text

13 datasets · 463,897 downloads

nvidia

Top license: cc-by-4.0 · arxiv-cited research

Video
Tabular
Image

10 datasets · 325,043 downloads

InternRobotics

Top license: not specified · arxiv-cited research

Text
Image
3d

99 datasets · 313,775 downloads

lerobot

Top license: mit · arxiv-cited research

Timeseries
Tabular
Video

326 datasets · 309,368 downloads

RoboCOIN

Top license: apache-2.0 · arxiv-cited research

Tabular
Timeseries
Video

3 datasets · 127,962 downloads

agibot-world

Top license: not specified

Text
Image

14 datasets · 66,470 downloads

lerobot-data-collection

Top license: apache-2.0

Tabular
Timeseries
Video

4 datasets · 36,775 downloads

HuggingFaceVLA

Top license: apache-2.0 · arxiv-cited research

Video
Image
Timeseries

37 datasets · 35,113 downloads

unitreerobotics

Top license: apache-2.0 · arxiv-cited research

Video
Tabular
Timeseries

4 datasets · 28,789 downloads

jnogga

Top license: cc-by-4.0

4 datasets · 23,195 downloads

Traly

Top license: apache-2.0 · arxiv-cited research

Tabular
Timeseries
Video

3 datasets · 14,785 downloads

OpenDriveLab-org

Top license: cc-by-nc-sa-4.0 · arxiv-cited research

Tabular
Timeseries
Video

3 datasets · 14,632 downloads

allenai

Top license: cc-by-4.0 · arxiv-cited research

Image
Text
Timeseries

4 datasets · 12,967 downloads

FedorX8

Top license: apache-2.0 · arxiv-cited research

Video
Tabular
Timeseries

3 datasets · 11,205 downloads

leggedrobotics

Top license: mit · arxiv-cited research

Image
Text

9 datasets · 11,127 downloads

ByteRainx

Top license: apache-2.0

Video

3 datasets · 10,231 downloads

fleaven

Top license: cc-by-4.0

3 datasets · 9,175 downloads

spatialverse

Top license: cc-by-nc-4.0 · arxiv-cited research

Image
3d

3 datasets · 9,132 downloads

shihao1895

Top license: mit · arxiv-cited research

10 datasets · 8,784 downloads

Factory-Intelligence

Top license: apache-2.0

Tabular
Timeseries
Video

3 datasets · 8,484 downloads

Joocjun

Top license: apache-2.0

Tabular
Video

4 datasets · 6,235 downloads

haosulab

Top license: apache-2.0 · arxiv-cited research

4 datasets · 5,769 downloads

ygtxr1997

Top license: apache-2.0

Video

3 datasets · 5,156 downloads

open-world-agents

Top license: cc-by-nc-4.0 · arxiv-cited research

Video

4 datasets · 4,315 downloads

Gongsta

Top license: apache-2.0

Tabular
Timeseries
Video

11 datasets · 4,081 downloads

oxe-auge

Top license: cc-by-4.0 · arxiv-cited research

Tabular
Text
Timeseries

7 datasets · 3,143 downloads

villekuosmanen

Top license: apache-2.0

Tabular
Timeseries
Video

8 datasets · 3,141 downloads

Mimic-Robotics

Top license: apache-2.0

Tabular
Timeseries
Video

5 datasets · 2,975 downloads

BeingBeyond

Top license: cc-by-4.0 · arxiv-cited research

Tabular
Timeseries
Video

5 datasets · 2,769 downloads

andreaskoepf

Top license: apache-2.0

Tabular
Timeseries
Video

3 datasets · 2,398 downloads

SUZ-tsinghua

Top license: apache-2.0

Image
Timeseries

3 datasets · 2,267 downloads

VLA-Arena

Top license: apache-2.0

Image
Timeseries

4 datasets · 1,943 downloads

DRMNmadhan

Top license: apache-2.0

Video
Tabular
Timeseries

3 datasets · 1,617 downloads

arth-shukla

Top license: mit · arxiv-cited research

3 datasets · 1,542 downloads

TESS-Computer

Top license: mit · arxiv-cited research

Text
Tabular

3 datasets · 1,240 downloads

autobio-bench

Top license: mit

Tabular
Timeseries
Video

3 datasets · 1,185 downloads

fbeltrao

Top license: apache-2.0

Tabular
Timeseries
Video

A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.

For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.

Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.

TRUELABEL ROUTING

Don't see an author you expected?

The watchlist updates from upstream HF metadata. Authors with fewer than three robotics-tagged records appear under their individual dataset pages but don't get their own cluster.

See the full Hugging Face watchlist

Hugging Face robotics datasets by author

Authors with three or more robotics datasets

IPEC-COMMUNITY

cadene

nvidia

InternRobotics

lerobot

RoboCOIN

agibot-world

lerobot-data-collection

HuggingFaceVLA

unitreerobotics

jnogga

Traly

OpenDriveLab-org

allenai

FedorX8

leggedrobotics

ByteRainx

fleaven

spatialverse

shihao1895

Factory-Intelligence

Joocjun

haosulab

ygtxr1997

open-world-agents

Gongsta

oxe-auge

villekuosmanen

Mimic-Robotics

BeingBeyond

andreaskoepf

SUZ-tsinghua

VLA-Arena

DRMNmadhan

arth-shukla

TESS-Computer

autobio-bench

fbeltrao

Use this record as part of a broader dataset review

Where to go next

Other places to verify the claims

Don't see an author you expected?