HiPHI-MOV
HiPHI (High Precision Human Interaction) Motion With Object & Vision



The HiPHI-MOV Dataset is a human-centric, high-fidelity multimodal corpus specifically engineered for the development of robust locomotion and whole-body loco-manipulation policies. It includes full-body motion capture, tracking of interacting objects, egocentric RGB-D visual data, third-person RGB-D visual data. HiPHI-MOV provides a synchronized data stream that co-registers ground-truth, full-body kinematic trajectories—captured via high-frequency motion capture—with ego-centric and exo-centric visual observations. This structured hierarchy enables the modeling of complex robotic behaviors, ranging from low-level motor primitives (joint-space dynamics) to high-level environmental affordances (scene-contextual navigation).
Acquisition devices
- Optical Motion Capture System
- Optical Object Tracking System
- High-Precision Tracking Camera
Modalities & precision
High precision- Hand: BVH FilesSub-mm
- Body: BVH FilesSub-mm
- Objects: 3D Mesh
Annual Data Production Capacity
Sample Data
Loading samples…
To view more of our datasets, please log in or register.
Dataset attributes & data distribution
Action distribution
Semantic Universe
Explore HiPHI-MOV motion families, semantic frames, and FrameNet-style lexical units across the script library.
Motion Quality
HiPHI-MOV traces set the clean artifact profile across motion sources.
Cross-dataset comparison
t-SNE visualization of motion samples from AMASS, LAFAN1, PHUMA, and HiPHI-MOV in a shared feature space. HiPHI-MOV occupies a broad portion of the embedding space and shows substantial overlap with the existing datasets, suggesting wide motion coverage and partial commonality in motion patterns, while still exhibiting distinctive local structures.
Want to partner with us?
We collaborate with teams pushing the edge of embodied AI.
To explore our ModalityNet datasets please register or login.