Almanac
paper

InSight: Self-Guided Skill Acquisition via Steerable VLAs

paperactiveprovisionalinsight-self-guided-skill-acquisition-via-steerable-vlas-4f27eae3·1 events·first seen 18h ago

Aliases: InSight: Self-Guided Skill Acquisition via Steerable VLAs

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.LG·18h ago·source ↗

InSight: Self-guided autonomous skill acquisition for vision-language-action models via primitive steerability

InSight is a framework enabling VLA models to autonomously acquire new manipulation skills beyond their training data by decomposing demonstrations into labeled primitive actions (e.g., 'move gripper to bowl', 'pour the bottle') and running a VLM-guided data flywheel that identifies missing primitives, attempts demonstrations, and integrates successful ones back into training. The system requires no human demonstrations of target skills and is evaluated on simulation and real-world tasks including block flipping, drawer closing, sweeping, and pouring. Learned primitives can be composed for novel long-horizon tasks, offering a practical path toward continual skill acquisition in robotic VLA policies.