HF AUTHOR CLUSTER

open-world-agents robotics datasets

3 robotics-tagged HF records from open-world-agents, totaling 5,156 cumulative downloads. Some records cite published arxiv research.

DIRECT ANSWER

Author clusters consolidate every record from one publisher into a single buyer-review surface. open-world-agents ships 3 robotics datasets on Hugging Face. Top license: cc-by-nc-4.0. Of those, 2 get a full standalone page, 0 get a shorter profile, and 1 are folded into this cluster.

Robotics-tagged

Records

Hub signal

Cumulative downloads

5,156

First-pass rights

Top license

cc-by-nc-4.0

License

Modality

3 of 3 datasets

D2E-480p

Published Apr 2026 · cc-by-nc-4.0 · open-world-agents

D2E-480p Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI.

2,236 downloads
1 likes
n<1K
Paper available

Video

D2E-Original

Published Apr 2026 · cc-by-nc-4.0 · open-world-agents

D2E-Original Project Page · Paper (arXiv) · GitHub · OWA Toolkit Documentation This is the dataset for D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI.

2,230 downloads
2 likes
n<1K
Paper available

Video

vpt-owamcap

Published Jul 2025 · apache-2.0 · open-world-agents

This dataset is an OWAMcap conversion from the Video PreTraining (VPT) minecraft dataset. It is compressed with the WebDataset Format, which is essentially a series of compressed tar files.

690 downloads
6 likes
10K<n<100K

A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.

For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.

Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.

TRUELABEL ROUTING

Need data like open-world-agents ships, but with cleaner rights?

If the Hub records don't carry the license, consent, or deployment fit your team needs, commission a custom collection on the same modality with explicit commercial terms.

Request similar data

open-world-agents robotics datasets

Records

Cumulative downloads

Top license

All 3 robotics records from open-world-agents

D2E-480p

D2E-Original

vpt-owamcap

Use this record as part of a broader dataset review

Where to go next

Other places to verify the claims

Need data like open-world-agents ships, but with cleaner rights?