Almanac
paper

Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy

paperactiveprovisionaladvancing-omnimodal-embodied-agents-from-isolated-skills-to-everyday-physical-autonomy-98d2d96f·1 events·first seen 3d ago

Aliases: Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.AI·3d ago·source ↗

OmniAct: Hierarchical framework for persistent embodied agents with unified cyber-physical action spaces

Researchers present OmniAct, a hierarchical asynchronous architecture for embodied agents that unifies cyber (APIs, IoT) and physical (manipulation, navigation) action spaces under a single multimodal semantic planner. The system incorporates adaptive hierarchical memory with event-boundary-driven compression to maintain sub-linear context growth, and an asynchronous visual preemption engine for closed-loop failure recovery during physical execution. Evaluated across 40 real-world long-horizon tasks on two robotic platforms coordinating four IoT devices, OmniAct achieves consistent end-to-end success improvements and elevates mid-scale open-weight models to proprietary-level performance. The work directly addresses the fragmentation between planning, memory, and verification in existing embodied agent systems.