Entity · benchmark

R2R

benchmarkactiver2r-6d9f389a·1 events·first seen May 29, 2026

Aliases: R2R

Co-occurring entities

Qwen-VLA DOMINO LIBERO Simpler-WidowX RoboTwin ALOHA Linear Diffusion Transformer RxR embodiment-aware prompt conditioning Alibaba Qwen Team

More like this (12)

R2R-CE R2 Indicator R^3 RL² Open R1 Double Ratchet πR² R-Drop G-RRM RMM RL Conductor GR2

Recent events (1)

7arXiv · cs.CL·May 29, 2026·source ↗

Qwen-VLA: Unified Vision-Language-Action Model Across Robot Tasks, Environments, and Embodiments

Alibaba's Qwen team presents Qwen-VLA, a unified embodied foundation model that extends the Qwen vision-language stack to continuous action and trajectory generation via a DiT-based action decoder. The model is jointly pretrained on diverse data spanning manipulation trajectories, egocentric demonstrations, synthetic simulation, and navigation data, with embodiment-aware prompt conditioning to support multiple robot platforms. A unified action-and-trajectory prediction framework covers manipulation, navigation, and trajectory prediction tasks. Benchmarks show strong results: 97.9% on LIBERO, 73.7% on Simpler-WidowX, 69.0% OSR on R2R navigation, and 76.9% average OOD success in real-world ALOHA experiments.

Frontier Model Releases Evaluation and Benchmarking Qwen-VLA DOMINO R2R +10 more