benchmark
HeraBench
benchmarkactiveprovisional
herabench-ecb44a2a·1 events·first seen 47h agoAliases: HeraBench
Co-occurring entities
More like this (12)
Recent events (1)
H-RePlan: Hierarchical recovery framework for multi-device computer-use agents
Researchers introduce H-RePlan, a hierarchical replanning framework for agents operating across multiple devices (Linux and Android) with unified API-CLI-GUI execution. The system separates device-local strategy recovery from orchestrator-level global replanning via a cross-layer failure abstraction, enabling finer-grained fault handling than existing retry or reassignment approaches. A companion benchmark, HeraBench, injects strategy- and device-level failures into cross-device workflows to evaluate recovery capability. Experiments show H-RePlan outperforms single-strategy and coarse-grained baselines on completion, instruction adherence, and token efficiency.