Entity · product

WidowX

productactiveprovisionalwidowx-35ee195a·1 events·first seen 20h ago

Aliases: WidowX

Co-occurring entities

SIMPLER Inverse Dynamics Task-Agnostic Pretraining (TAP)

More like this (12)

Simpler-WidowX QwenLM Qwen3 Reynold Xin Yuxi Qwen WISE Qwen Code Qwen-VL Qwen Team Qwen Chat Qwen3.6-Plus

Recent events (1)

6arXiv · cs.AI·20h ago·source ↗

Task-Agnostic Pretraining (TAP) decouples motor learning from language grounding in VLA models

Researchers propose Task-Agnostic Pretraining (TAP), a two-stage framework for Vision-Language-Action models that separates physical motor skill acquisition from semantic language alignment. The first stage learns motor priors from cheap unlabeled interaction data via a self-supervised Inverse Dynamics objective; the second stage grounds these priors in language using minimal expert demonstrations. On the SIMPLER benchmark, TAP matches models trained on over 1M expert trajectories while using orders of magnitude less labeled data, and on a real-world WidowX robot retains 25% success under camera perturbations where internet-scale baselines collapse to 0%.

Multimodal Progress SIMPLER Inverse Dynamics WidowX +1 more