truelabel

HF AUTHOR INDEX

Hugging Face robotics datasets by author

The 1,001 robotics-tagged HF records ship from 38 authors with three or more datasets each. Author clusters are the canonical destination for buyer queries like "Stanford robotics datasets" or "NVIDIA physical AI datasets" that aren't well-served by individual record pages.

DIRECT ANSWER

Each cluster aggregates every record from one author — including those demoted from the indexable surface for thin metadata or duplicate variants. Clusters surface the author’s license posture, total downloads, top modalities, and any arxiv-cited research records. 667 records across 38 authors.

38 AUTHORS

Authors with three or more robotics datasets

30 datasets · 804,037 downloads

IPEC-COMMUNITY

Top license: apache-2.0 · arxiv-cited research

  • Video
  • Tabular
  • Timeseries

6 datasets · 659,048 downloads

cadene

Top license: apache-2.0 · arxiv-cited research

  • Video
  • Tabular
  • Text

13 datasets · 463,897 downloads

nvidia

Top license: cc-by-4.0 · arxiv-cited research

  • Video
  • Tabular
  • Image

10 datasets · 325,043 downloads

InternRobotics

Top license: not specified · arxiv-cited research

  • Text
  • Image
  • 3d

99 datasets · 313,775 downloads

lerobot

Top license: mit · arxiv-cited research

  • Timeseries
  • Tabular
  • Video

326 datasets · 309,368 downloads

RoboCOIN

Top license: apache-2.0 · arxiv-cited research

  • Tabular
  • Timeseries
  • Video

3 datasets · 127,962 downloads

agibot-world

Top license: not specified

  • Text
  • Image

14 datasets · 66,470 downloads

lerobot-data-collection

Top license: apache-2.0

  • Tabular
  • Timeseries
  • Video

4 datasets · 36,775 downloads

HuggingFaceVLA

Top license: apache-2.0 · arxiv-cited research

  • Video
  • Image
  • Timeseries

37 datasets · 35,113 downloads

unitreerobotics

Top license: apache-2.0 · arxiv-cited research

  • Video
  • Tabular
  • Timeseries

4 datasets · 28,789 downloads

jnogga

Top license: cc-by-4.0

4 datasets · 23,195 downloads

Traly

Top license: apache-2.0 · arxiv-cited research

  • Tabular
  • Timeseries
  • Video

3 datasets · 14,785 downloads

OpenDriveLab-org

Top license: cc-by-nc-sa-4.0 · arxiv-cited research

  • Tabular
  • Timeseries
  • Video

3 datasets · 14,632 downloads

allenai

Top license: cc-by-4.0 · arxiv-cited research

  • Image
  • Text
  • Timeseries

4 datasets · 12,967 downloads

FedorX8

Top license: apache-2.0 · arxiv-cited research

  • Video
  • Tabular
  • Timeseries

3 datasets · 11,205 downloads

leggedrobotics

Top license: mit · arxiv-cited research

  • Image
  • Text

9 datasets · 11,127 downloads

ByteRainx

Top license: apache-2.0

  • Video

3 datasets · 10,231 downloads

fleaven

Top license: cc-by-4.0

3 datasets · 9,175 downloads

spatialverse

Top license: cc-by-nc-4.0 · arxiv-cited research

  • Image
  • 3d

3 datasets · 9,132 downloads

shihao1895

Top license: mit · arxiv-cited research

10 datasets · 8,784 downloads

Factory-Intelligence

Top license: apache-2.0

  • Tabular
  • Timeseries
  • Video

3 datasets · 8,484 downloads

Joocjun

Top license: apache-2.0

  • Tabular
  • Video

4 datasets · 6,235 downloads

haosulab

Top license: apache-2.0 · arxiv-cited research

4 datasets · 5,769 downloads

ygtxr1997

Top license: apache-2.0

  • Video

3 datasets · 5,156 downloads

open-world-agents

Top license: cc-by-nc-4.0 · arxiv-cited research

  • Video

4 datasets · 4,315 downloads

Gongsta

Top license: apache-2.0

  • Tabular
  • Timeseries
  • Video

11 datasets · 4,081 downloads

oxe-auge

Top license: cc-by-4.0 · arxiv-cited research

  • Tabular
  • Text
  • Timeseries

7 datasets · 3,143 downloads

villekuosmanen

Top license: apache-2.0

  • Tabular
  • Timeseries
  • Video

8 datasets · 3,141 downloads

Mimic-Robotics

Top license: apache-2.0

  • Tabular
  • Timeseries
  • Video

5 datasets · 2,975 downloads

BeingBeyond

Top license: cc-by-4.0 · arxiv-cited research

  • Tabular
  • Timeseries
  • Video

5 datasets · 2,769 downloads

andreaskoepf

Top license: apache-2.0

  • Tabular
  • Timeseries
  • Video

3 datasets · 2,398 downloads

SUZ-tsinghua

Top license: apache-2.0

  • Image
  • Timeseries

3 datasets · 2,267 downloads

VLA-Arena

Top license: apache-2.0

  • Image
  • Timeseries

4 datasets · 1,943 downloads

DRMNmadhan

Top license: apache-2.0

  • Video
  • Tabular
  • Timeseries

3 datasets · 1,617 downloads

arth-shukla

Top license: mit · arxiv-cited research

3 datasets · 1,542 downloads

TESS-Computer

Top license: mit · arxiv-cited research

  • Text
  • Tabular

3 datasets · 1,240 downloads

autobio-bench

Top license: mit

  • Tabular
  • Timeseries
  • Video

3 datasets · 1,185 downloads

fbeltrao

Top license: apache-2.0

  • Tabular
  • Timeseries
  • Video

RESEARCH PATHS

Use this record as part of a broader dataset review

A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.

For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.

Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.

INTERNAL LINKS

Continue the buyer workflow

EXTERNAL REFERENCES

Source context to verify

TRUELABEL ROUTING

Don't see an author you expected?

The watchlist updates from upstream HF metadata. Authors with fewer than three robotics-tagged records appear under their individual dataset pages but don't get their own cluster.

See the full Hugging Face watchlist