5arXiv cs.LG (Machine Learning)·10d ago

FACTR 2: Neural External Torque Estimation enables force-aware robot policy learning without dedicated sensors

Researchers introduce NEXT (Neural External Torque Estimation), a data-driven method that estimates external joint torques on commodity robot arms without dedicated force sensors, training in one minute from ten minutes of free-motion data. Combined with Force-Informed Re-Sampling Training (FIRST), which up-samples contact segments during behavior cloning, the system outperforms prior force-aware policies by over 17% in task progress across five long-horizon manipulation tasks. The work lowers the hardware barrier for contact-rich robot manipulation by bringing force-feedback teleoperation and policy learning to off-the-shelf arms.

Agent and Tool Ecosystem Neural External Torque Estimation Force-Informed Re-Sampling Training FACTR 2

Related guides (1)

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Related events (8)

5arXiv · cs.AI·24d ago·source ↗

Beyond Binary: Sim-to-Real Dexterous Manipulation with Physics-Grounded Contact Representation (CoP)

Researchers introduce Center-of-Pressure (CoP), a tactile representation grounded in physical principles designed to bridge the sim-to-real gap in contact-rich dexterous manipulation. CoP preserves dense contact information while remaining robust for sim-to-real transfer, supported by a differentiable-dynamics-based sensor calibration scheme that estimates taxel orientations without ground-truth force measurements. Evaluated on peg-in-hole insertion and ball balancing tasks, CoP-conditioned policies achieve zero-shot sim-to-real transfer on a multi-fingered robotic hand, outperforming binary-contact and raw-taxel baselines. An emergent finding is that CoP-conditioned policies implicitly encode task-relevant physical properties such as object mass.

Evaluation and Benchmarking Agent and Tool Ecosystem multi-fingered dexterous hand Center-of-Pressure (CoP) tactile representation ball balancing +5 more

5arXiv · cs.LG·9d ago·source ↗

Mana framework achieves zero-shot sim-to-real transfer for dexterous articulated tool manipulation

Researchers introduce Mana (Manipulation Animator), a sim-to-real framework that reframes dexterous robotic manipulation as an animation problem using a coarse-to-fine pipeline of procedurally-generated grasp keyframes, motion planning, and reinforcement learning. The system requires minimal human input (under one minute per tool) and achieves zero-shot sim-to-real transfer across four articulated tools with varying joint types and scales. The work addresses a longstanding gap in dexterous robotics where articulated tool use—requiring coordination of internal degrees of freedom and contact-rich interactions—has been underexplored relative to rigid object manipulation.

Agent and Tool Ecosystem Mana Mana: Dexterous Manipulation of Articulated Tools

5Openai Blog·1mo ago·source ↗

Sim-to-real transfer of robotic control with dynamics randomization

OpenAI published research on transferring robotic control policies trained in simulation to real-world robots using dynamics randomization. The technique involves varying physical parameters during simulation training so that the real world appears as just another variation, enabling zero-shot sim-to-real transfer. This was an early foundational contribution to the sim-to-real robotics research thread.

Agent and Tool Ecosystem dynamics randomization sim-to-real transfer OpenAI

6arXiv · cs.LG·23d ago·source ↗

DynaFLIP: Dynamics-Aware Multimodal Pre-Training for Robot Manipulation Perception

DynaFLIP is a pre-training framework that injects motion understanding into visual encoders for robot manipulation by constructing image-language-3D flow triplets from human and robot videos. The method encourages tri-modal alignment via simplex-volume minimization in a shared hyperspherical space, combined with cosine regularization and contrastive objectives. The resulting dynamics-aware visual backbone consistently outperforms baselines across diverse downstream policies including VLAs, with gains up to +22.5% in out-of-distribution scenarios. The work argues that robot generalization requires encoding how the world changes under action, not just static scene content.

Frontier Model Releases Agent and Tool Ecosystem Vision-Language-Action models simplex-volume minimization DynaFLIP +3 more

5Hugging Face Blog·1mo ago·source ↗

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

Hugging Face published a blog post on asynchronous robot inference, a technique that decouples the timing of action prediction from action execution in robotic systems. This approach addresses latency bottlenecks that arise when large neural network inference times exceed the real-time control loop requirements of physical robots. The post likely covers architectural patterns and implementation strategies for deploying vision-language-action models or similar policies on robot hardware without blocking the control pipeline.

Inference Economics Enterprise Deployment Patterns asynchronous inference Vision-Language-Action model Hugging Face +1 more

4Openai Blog·1mo ago·source ↗

Generalizing from Simulation: OpenAI Sim-to-Real Robotics Transfer

OpenAI published results on sim-to-real transfer for robot controllers, demonstrating that policies trained entirely in simulation can be deployed on physical robots and respond to unplanned environmental changes. The work represents a shift from open-loop to closed-loop control systems in robotics. This is a 2017 research milestone predating current frontier model work but relevant to the historical trajectory of OpenAI's robotics program.

Agent and Tool Ecosystem sim-to-real transfer closed-loop control OpenAI

5arXiv · cs.AI·12d ago·source ↗

DARP: Semi-parametric retrieval-based imitation learning reduces compounding errors by 15-46%

Researchers introduce DARP (Difference-Aware Retrieval Policies), a semi-parametric imitation learning method that retrieves k-nearest neighbor demonstrations at inference time and predicts actions based on relative distance vectors between neighbor and query states. The approach reparameterizes behavior cloning around local neighborhood structure rather than global state-to-action mappings, requiring no additional data collection or online expert feedback. Across continuous control and robotic manipulation tasks, DARP shows 15-46% performance improvements over standard behavior cloning, including on high-dimensional visual inputs.

Agent and Tool Ecosystem DARP Difference-Aware Retrieval Policies for Imitation Learning

4Openai Blog·1mo ago·source ↗

Ingredients for robotics research

OpenAI released eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay (HER), developed over the prior year for internal research. These environments were used to train models that transfer to physical robots. The release also included a set of research requests to guide community contributions in robotics.

Agent and Tool Ecosystem Hindsight Experience Replay OpenAI Baselines OpenAI