We are building the largest human centric omni-modal datasetfor embodied ai

Available categories include Motion-with-vision, In-the-factory, In-the-wild and pure Motion Capture

Over 20,000 hours of omi-modal, time synced human-centric data with videoAudioPressureOver 20,000 hours of omi-modal, time synced human-centric data with videoAudioPressureOver 20,000 hours of omi-modal, time synced human-centric data with videoAudioPressureOver 20,000 hours of omi-modal, time synced human-centric data with videoAudioPressure

25k+Systems Shipped

120HzTemporal Resolution

4Synchronized Modalities

EnterpriseLicensing Ready

Multimodal Data

Everything you see
is time-aligned.

Licensable, synchronized multi-modal datasets for humanoid robotics. Let engineers preview, verify quality, and license precisely what they need — scene by scene, not whole archives.

Learn More

Sync Accuracy

<1ms

Why Multi-Modal

Finger-level truth (not approximations)

Millimeter-scale hand/finger kinematics plus pressure/contact signals, so models can learn real grasp dynamics—not just pose trajectories.

Synchronized multi-modal ground truth

Motion + multi-view video (and additional signals where applicable) captured in time alignment, enabling strong visual grounding and cross-modal learning.

Coverage across the full realism spectrum

Controlled “factory-grade” precision, large-space motion-with-vision, and truly natural “in-the-wild” behavior—so training data spans clean labels and messy real-world variance.

Built for scale, consistency, and deployment

Repeatable acquisition pipelines, standardized calibration/QA, and dataset structure designed for model training workflows—so you get reliable data, not one-off demos.

Category Overview

Browse All →

MoCap

Motion Capture

120Hz full-body skeletal data with optical-inertial hybrid rigs. Sub-millisecond synchronization across all sensors.

2,400+sequences

ITF

In-The-Field

Unstructured real-world environments: retail, warehouse, kitchen, outdoor. Portable rig captures for domain diversity.

1,800+sequences

ITW

In-The-Wild

Consumer-grade capture for scale and behavioral diversity. Covers rare interaction primitives across demographics.

5,600+sequences

MWV

Multi-View

Synchronized multi-camera arrays with calibrated extrinsics. Enables 3D scene reconstruction and ego/third-person pairing.

900+sequences

Best Sellers

NEW

Bottle handling 0001

$30.00MoCap · 48 seq

NEW

Bottle handling 0002

FreeMoCap · 32 seq

NEW

Bottle handling 0003

Preview Only

PreviewITF · 56 seq

NEW

Bottle handling 0004

OwnedMoCap · 41 seq

NEW

Bottle handling 0005

$30.00MWV · 62 seq

HOT

Walking gait 0001

$45.00MoCap · 120 seq

FEATURED

Object grasp 0001

$60.00ITW · 88 seq

HOT

Bimanual assembly 0001

$55.00MoCap · 74 seq

Trusted By Pioneers in Robotics and AI

From academic institutions to global fortune 500 companies
our data and acquisition pipelines support the current and future development
of humanoid robotics embodied ai.