Entity · benchmark

Sokoban

benchmarkactivesokoban-5ddd673a·1 events·first seen Jun 15, 2026

Aliases: Sokoban

Co-occurring entities

ALFWorld RePro Qwen WebShop

More like this (12)

Sudoku-Extreme text games Box Matrix-Game PAC-MAN Megablocks AgentBoard SkillGate Slay the Spire 2 BGE MiniMind Jukebox

Recent events (1)

5arXiv · cs.CL·Jun 15, 2026·source ↗

RePro: Retrospective Progress-Aware Self-Refinement for LLM Agent Training

Researchers introduce RePro (Retrospective Progress-Aware Training), a framework addressing the gap between step-wise RL optimization and metacognitive task-progress awareness in LLM agents. The approach uses a forward-then-reflect rollout paradigm where agents execute actions online and then retrospectively assess step-wise progress given the completed trajectory and known outcome. Evaluated on WebShop, ALFWorld, and Sokoban, RePro achieves up to 12% absolute success rate gains over baseline Qwen-family models without requiring continuous external supervision.

Agent and Tool Ecosystem Alignment and RLHF ALFWorld Sokoban RePro +2 more