Entity · person

Jiaheng Hu

personactivejiaheng-hu-486310a8·1 events·first seen May 18, 2026

Aliases: Jiaheng Hu

Co-occurring entities

Elastic Weight Consolidation Dark Experience Replay University of California Los Angeles LIBERO catastrophic forgetting GRPO LoRA Sony Nanyang Technological University Jay Shim OpenVLA-OFT University of Texas Austin

More like this (12)

Hanjiang Hu Jiasen Lu Jiazheng Xing Wei Zhepei Huawei David Chen Jiyuan Tan Hanlin Zhu DayuanJiang Sizhe Chen Jocelyn Shen Stephanie Lin

Recent events (1)

5The Batch·May 18, 2026·source ↗

Sony and University Researchers Train Robots To Learn Without Catastrophic Forgetting

Researchers from UT Austin, UCLA, Nanyang Technological University, and Sony developed a sequential fine-tuning recipe combining LoRA and on-policy reinforcement learning (GRPO) to reduce catastrophic forgetting in vision-language-action (VLA) models for robotics. Applied to the OpenVLA-OFT model on the LIBERO benchmark, the method achieved 81.2% success on libero-spatial tasks with near-zero forgetting (0.3 percentage point drop), outperforming established continual learning baselines including Dark Experience Replay and Elastic Weight Consolidation. The approach requires no replay of prior task data and also showed modest generalization to unseen tasks. The authors note the method has not yet been tested outside robotics simulation contexts.

Evaluation and Benchmarking Agent and Tool Ecosystem Elastic Weight Consolidation Dark Experience Replay University of California Los Angeles +11 more