Records
30
HF AUTHOR CLUSTER
30 robotics-tagged HF records from IPEC-COMMUNITY, totaling 804,037 cumulative downloads. Some records cite published arxiv research.
DIRECT ANSWER
Author clusters consolidate every record from one publisher into a single buyer-review surface. IPEC-COMMUNITY ships 30 robotics datasets on Hugging Face. Top license: apache-2.0. Tier breakdown: 30 indexable as Tier A, 0 as Tier B, 0 demoted (those URLs redirect here).
30
804,037
apache-2.0
DATASETS
30 of 30 datasets
299,775 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 92233, "total_frames": 27044326, "total_tasks": 31308,
165,979 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 442226, "total_frames": 7045476, "total_tasks": 127605,
151,758 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "widowx", "total_episodes": 53192, "total_frames": 1893026, "total_tasks": 19974,
96,222 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 87212, "total_frames": 3786400, "total_tasks": 5
46,810 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "kuka_iiwa", "total_episodes": 209880, "total_frames": 2455879, "total_tasks": 1,
6,440 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 8612, "total_frames": 1137459, "total_tasks": 24, "tot
4,302 downloads · apache-2.0
🤖 EO-Data-1.5M A Large-Scale Interleaved Vision-Text-Action Dataset for Embodied AI The first large-scale interleaved embodied dataset emphasizing temporal dynamics and causal dependencies among visi
3,630 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 3921, "total_frames": 569249, "total_tasks": 73, "tota
3,535 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "google_robot", "total_episodes": 39350, "total_frames": 5471693, "total_tasks": 1
2,692 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 5100, "total_frames": 3948057, "total_tasks": 9, "tota
2,377 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 3242, "total_frames": 213972, "total_tasks": 403, "tot
2,275 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 5208, "total_frames": 1139911, "total_tasks": 7
2,118 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "ur5", "total_episodes": 896, "total_frames": 87783, "total_tasks": 5, "total_vide
1,747 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "sawyer", "total_episodes": 1796, "total_frames": 168423, "total_tasks": 3, "total
1,489 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1500, "total_frames": 361883, "total_tasks": 50, "tota
1,335 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "jaco_2", "total_episodes": 976, "total_frames": 70127, "total_tasks": 88, "total_
1,263 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 902, "total_frames": 294139, "total_tasks": 1, "total_
1,192 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 631, "total_frames": 146241, "total_tasks": 7, "total_
1,046 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 1482, "total_frames": 38240, "total_tasks": 1, "total_
1,040 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 559, "total_frames": 279939, "total_tasks": 2, "total_
1,028 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 570, "total_frames": 358234, "total_tasks": 3, "total_
980 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 576, "total_frames": 235922, "total_tasks": 44, "total
913 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 908, "total_frames": 392578, "total_tasks": 4, "total_
803 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 240, "total_frames": 353094, "total_tasks": 4, "total_
763 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 135, "total_frames": 68913, "total_tasks": 3, "total_v
622 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "franka", "total_episodes": 365, "total_frames": 34448, "total_tasks": 1, "total_v
531 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "xarm", "total_episodes": 480, "total_frames": 45308, "total_tasks": 6, "total_vid
528 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 435, "total_frames": 18196, "total_tasks": 1, "
474 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "fanuc_mate", "total_episodes": 415, "total_frames": 62613, "total_tasks": 32, "to
370 downloads · apache-2.0
This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v2.0", "robot_type": "hello_stretch", "total_episodes": 135, "total_frames": 25016, "total_tasks": 5, "
RESEARCH PATHS
A dataset record is only useful when it connects into the rest of the buyer workflow. The next review step is usually not another summary; it is a fit check, rights triage, source comparison, or custom bounty spec that names the missing proof.
For physical AI teams, the hard question is whether the public source can support a specific model objective under real deployment constraints. That requires adjacent dataset records, tools, comparisons, and sourcing paths, plus external references that a reviewer can open and challenge.
Use the links below to keep the review grounded. Start broad when discovery is incomplete, move into profile and comparison pages when the candidate source is known, and switch to custom collection when the blocker is rights, consent, geography, robot embodiment, or target environment coverage.
INTERNAL LINKS
Use the catalog to compare source-backed dataset profiles by modality, task, rights signal, consent risk, and deployment fit.
Scan the broader robotics dataset surface before narrowing into promoted profiles, comparisons, and custom collection specs.
Track source updates, licensing notes, and buyer-readiness changes that should trigger a renewed review.
Score whether a public source is enough for the model, rights path, modalities, and target environment.
Separate source license language from contributor consent, redistribution, private-space risk, and model-use assumptions.
Turn a public-source gap into a scoped capture request with sample QA, metadata, and delivery requirements.
Compare data providers when the answer is not another public dataset but a better sourcing or capture route.
Use the company index to separate annotation vendors, data engines, marketplaces, and specialist capture teams.
EXTERNAL REFERENCES
Market context for why physical AI systems need custom, enriched, real-world data beyond generic labeling workflows.
Robotics dataset and tooling context for Hugging Face based collection, sharing, conversion, and training workflows.
A cross-embodiment robotics dataset reference for comparing trajectory scale, robot diversity, and VLA training assumptions.
A large in-the-wild robot manipulation dataset reference for real-world trajectory capture and deployment transfer risk.
TRUELABEL ROUTING
If the Hub records don't carry the license, consent, or deployment fit your team needs, commission a custom collection on the same modality with explicit commercial terms.