Almanac
paper

VLK

paperactiveprovisionalvlk-9a962070·1 events·first seen 3d ago

Aliases: VLK

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.AI·3d ago·source ↗

VLK: Synthetic vision-language-kinematics supervision enables humanoid loco-manipulation from egocentric observations

Researchers introduce a pipeline that generates 48,000 paired vision-language-kinematics trajectories synthetically using 3D Gaussian Splatting to reconstruct indoor scenes, bypassing the need for expensive human-annotated robot data. A VLK policy trained on this data predicts whole-body kinematic trajectories from egocentric images and language instructions, which a whole-body tracker converts to physical actions. The approach is validated on a Unitree G1 humanoid performing navigation and object transport, demonstrating viable sim-to-real transfer for perception-based loco-manipulation.