paper
VLK
paperactiveprovisional
vlk-9a962070·1 events·first seen 3d agoAliases: VLK
Co-occurring entities
More like this (12)
Recent events (1)
VLK: Synthetic vision-language-kinematics supervision enables humanoid loco-manipulation from egocentric observations
Researchers introduce a pipeline that generates 48,000 paired vision-language-kinematics trajectories synthetically using 3D Gaussian Splatting to reconstruct indoor scenes, bypassing the need for expensive human-annotated robot data. A VLK policy trained on this data predicts whole-body kinematic trajectories from egocentric images and language instructions, which a whole-body tracker converts to physical actions. The approach is validated on a Unitree G1 humanoid performing navigation and object transport, demonstrating viable sim-to-real transfer for perception-based loco-manipulation.