FACTR 2: Neural External Torque Estimation enables force-aware robot policy learning without dedicated sensors
Researchers introduce NEXT (Neural External Torque Estimation), a data-driven method that estimates external joint torques on commodity robot arms without dedicated force sensors, training in one minute from ten minutes of free-motion data. Combined with Force-Informed Re-Sampling Training (FIRST), which up-samples contact segments during behavior cloning, the system outperforms prior force-aware policies by over 17% in task progress across five long-horizon manipulation tasks. The work lowers the hardware barrier for contact-rich robot manipulation by bringing force-feedback teleoperation and policy learning to off-the-shelf arms.
Related guides (1)
Related events (8)
Beyond Binary: Sim-to-Real Dexterous Manipulation with Physics-Grounded Contact Representation (CoP)
Researchers introduce Center-of-Pressure (CoP), a tactile representation grounded in physical principles designed to bridge the sim-to-real gap in contact-rich dexterous manipulation. CoP preserves dense contact information while remaining robust for sim-to-real transfer, supported by a differentiable-dynamics-based sensor calibration scheme that estimates taxel orientations without ground-truth force measurements. Evaluated on peg-in-hole insertion and ball balancing tasks, CoP-conditioned policies achieve zero-shot sim-to-real transfer on a multi-fingered robotic hand, outperforming binary-contact and raw-taxel baselines. An emergent finding is that CoP-conditioned policies implicitly encode task-relevant physical properties such as object mass.
Mana framework achieves zero-shot sim-to-real transfer for dexterous articulated tool manipulation
Researchers introduce Mana (Manipulation Animator), a sim-to-real framework that reframes dexterous robotic manipulation as an animation problem using a coarse-to-fine pipeline of procedurally-generated grasp keyframes, motion planning, and reinforcement learning. The system requires minimal human input (under one minute per tool) and achieves zero-shot sim-to-real transfer across four articulated tools with varying joint types and scales. The work addresses a longstanding gap in dexterous robotics where articulated tool use—requiring coordination of internal degrees of freedom and contact-rich interactions—has been underexplored relative to rigid object manipulation.
Sim-to-real transfer of robotic control with dynamics randomization
OpenAI published research on transferring robotic control policies trained in simulation to real-world robots using dynamics randomization. The technique involves varying physical parameters during simulation training so that the real world appears as just another variation, enabling zero-shot sim-to-real transfer. This was an early foundational contribution to the sim-to-real robotics research thread.
DynaFLIP: Dynamics-Aware Multimodal Pre-Training for Robot Manipulation Perception
DynaFLIP is a pre-training framework that injects motion understanding into visual encoders for robot manipulation by constructing image-language-3D flow triplets from human and robot videos. The method encourages tri-modal alignment via simplex-volume minimization in a shared hyperspherical space, combined with cosine regularization and contrastive objectives. The resulting dynamics-aware visual backbone consistently outperforms baselines across diverse downstream policies including VLAs, with gains up to +22.5% in out-of-distribution scenarios. The work argues that robot generalization requires encoding how the world changes under action, not just static scene content.
Asynchronous Robot Inference: Decoupling Action Prediction and Execution
Hugging Face published a blog post on asynchronous robot inference, a technique that decouples the timing of action prediction from action execution in robotic systems. This approach addresses latency bottlenecks that arise when large neural network inference times exceed the real-time control loop requirements of physical robots. The post likely covers architectural patterns and implementation strategies for deploying vision-language-action models or similar policies on robot hardware without blocking the control pipeline.
Generalizing from Simulation: OpenAI Sim-to-Real Robotics Transfer
OpenAI published results on sim-to-real transfer for robot controllers, demonstrating that policies trained entirely in simulation can be deployed on physical robots and respond to unplanned environmental changes. The work represents a shift from open-loop to closed-loop control systems in robotics. This is a 2017 research milestone predating current frontier model work but relevant to the historical trajectory of OpenAI's robotics program.
DARP: Semi-parametric retrieval-based imitation learning reduces compounding errors by 15-46%
Researchers introduce DARP (Difference-Aware Retrieval Policies), a semi-parametric imitation learning method that retrieves k-nearest neighbor demonstrations at inference time and predicts actions based on relative distance vectors between neighbor and query states. The approach reparameterizes behavior cloning around local neighborhood structure rather than global state-to-action mappings, requiring no additional data collection or online expert feedback. Across continuous control and robotic manipulation tasks, DARP shows 15-46% performance improvements over standard behavior cloning, including on high-dimensional visual inputs.
Ingredients for robotics research
OpenAI released eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay (HER), developed over the prior year for internal research. These environments were used to train models that transfer to physical robots. The release also included a set of research requests to guide community contributions in robotics.
