6The Batch (DeepLearning.AI)·28d ago

Hermes Agent Surpasses OpenClaw in Daily Token Usage, Highlighting Self-Improving Agentic Capabilities

Hermes Agent, an open-source personal agent from Nous Research launched in February 2026, has overtaken OpenClaw on OpenRouter's daily token consumption leaderboard. It distinguishes itself through automatic skill creation (converting successful task completions into reusable SKILL.md instruction files), a two-tier memory architecture with intelligent deduplication and merging, and a Curator background process that manages skill lifecycle. The agent supports local or cloud deployment, integrates with ~20 messaging services, and works with a wide variety of LLMs, positioning it as a model-agnostic alternative in the emerging personal agent category.

Long Context Evolution Open Weights Progress Agent and Tool Ecosystem Honcho OpenClaw Agent Communication Protocol Anthropic Claude Code Curator Hermes Agent OpenRouter OpenAI OpenAI Codex SKILL.md Nous Research Anthropic

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Long Context EvolutionTopic guide

Long Context Evolution: From Bigger Windows to Smarter Memory

Read asBeginner In-depth

Anthropic

Anthropic: The AI Safety Company at the Center of the Frontier

Read asBeginner

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner

Related events (8)

5The Batch·28d ago·source ↗

Hermes Agent Challenges OpenClaw on Token Usage Leaderboard; Agent Self-Improvement Highlighted

Hermes Agent, an open-source AI agent from Nous Research launched in February 2026, has surpassed OpenClaw on OpenRouter's daily token consumption leaderboard. Hermes Agent differentiates itself through a memory architecture and automatic skill-building capability using the SKILL.md format, enabling self-improvement as a core agentic feature. It supports local and cloud deployment, integrates with ~20 messaging services, and works with a wide variety of LLMs via the Agent Communication Protocol. The piece also covers Andrew Ng's commentary on Harvard's grade-capping policy, which is tangential to AI/ML.

Open Weights Progress Agent and Tool Ecosystem DeepLearning.AI OpenClaw Agent Communication Protocol +5 more

4Github Trending·16d ago·source ↗

Hermes Desktop: open-source desktop companion for Hermes Agent gains traction on GitHub

Hermes Desktop is a TypeScript-based open-source desktop application serving as a companion interface for the Hermes Agent. The repository has accumulated over 10,000 stars with 417 added in a single day, suggesting significant community interest. The project appears to be a local/desktop agent harness in the growing ecosystem of open-source AI agent tooling.

Agent and Tool Ecosystem Hermes Desktop fathah

5Github Trending·1mo ago·source ↗

NousResearch/hermes-agent: Trending Python Agent Framework

NousResearch has published hermes-agent, a Python-based agent framework described as 'the agent that grows with you.' The repository is trending heavily on GitHub with 161,088 total stars and 1,923 stars added today, indicating significant community interest. The project appears to be an agentic system built around NousResearch's Hermes model line.

Open Weights Progress Agent and Tool Ecosystem Hermes Hermes Agent NousResearch

8The Batch·8d ago·source ↗

Anthropic launches Claude Mythos 5 and Claude Fable 5; Andrew Ng introduces OpenCoworker desktop agent

Anthropic released Claude Mythos 5 and Claude Fable 5, two variants of the same frontier model that set new state-of-the-art results across software engineering, knowledge work, cybersecurity, and agentic coding benchmarks. Claude Fable 5 is the general-availability version with safety classifiers that restrict responses on security, biology, chemistry, and cutting-edge AI topics, priced at $10/$50 per million input/output tokens; Mythos 5 is restricted to selected partners via Project Glasswing. Separately, Andrew Ng and collaborators released OpenCoworker, a free open-source desktop agent harness built on top of aisuite, designed to give users privacy-preserving agentic workflows with their own API keys or local models. The newsletter also contextualizes the broader shift toward LLM-driven agent harnesses as frontier models have become capable enough to reliably drive next-action decisions.

Frontier Model Releases AI Safety Research Ollama DeepLearning.AI Claude Mythos +13 more

3Github Trending·1mo ago·source ↗

NousResearch Hermes Paperclip Adapter: Run Hermes Agent as a Managed Employee

A TypeScript adapter enabling NousResearch's Hermes agent to operate within the Paperclip company management framework, effectively treating the AI agent as a managed employee. The project has accumulated 1,311 GitHub stars with 37 added today, suggesting moderate community interest. This represents a tooling integration in the agent-as-worker paradigm, connecting an open-weights model ecosystem to an enterprise-style agent orchestration layer.

Open Weights Progress Agent and Tool Ecosystem hermes-paperclip-adapter Hermes Paperclip +1 more

9Anthropic News·19d ago·source ↗

Anthropic Releases Claude Opus 4.5 with State-of-the-Art Coding, Agent, and Computer Use Capabilities

Anthropic has released Claude Opus 4.5, positioning it as the best model in the world for coding, agentic workflows, and computer use, with pricing reduced to $5/$25 per million input/output tokens. The model demonstrates significant token efficiency gains—up to 65% fewer tokens than prior models on equivalent tasks—alongside improvements in long-horizon autonomous task execution, multi-step reasoning, and self-improving agent behavior. The release is accompanied by updates to Claude Code, the Claude Developer Platform, and integrations with Excel, Chrome, and desktop environments. Early partner feedback from GitHub Copilot, Cursor, Notion, Warp, and others reports measurable benchmark improvements and new use cases previously out of reach.

Frontier Model Releases Evaluation and Benchmarking Notion Claude Opus 4.6 Lovable +12 more

6arXiv · cs.AI·25d ago·source ↗

Claw-Anything: Benchmark for Always-On Personal Assistants with Broad Digital World Access

Claw-Anything is a new benchmark designed to evaluate LLM agents acting as always-on personal assistants with access to long-horizon activity histories, interdependent backend services, and multi-device GUI/CLI interaction. The benchmark simulates months of user activity to create complex, noisy world states and evaluates both reactive and proactive assistance. GPT-5.5 achieves only 34.5% pass@1, revealing a substantial capability gap versus prior narrower benchmarks. An accompanying automated data-generation pipeline produces 2,000 training environments and yields a 23.7% improvement over the base model.

Long Context Evolution Evaluation and Benchmarking multi-round event injection Claw-Anything large language model agents +3 more

4Github Trending·16d ago·source ↗

NVIDIA NemoClaw: Secure agent execution inside NVIDIA OpenShell with managed inference

NVIDIA has published NemoClaw, a TypeScript project on GitHub for running AI agents such as Hermes and OpenClaw more securely inside NVIDIA OpenShell with managed inference. The repository has accumulated over 20,000 stars, suggesting notable community interest. The project appears to be part of NVIDIA's broader NeMo ecosystem for enterprise AI agent deployment.

Inference Economics Agent and Tool Ecosystem Hermes OpenClaw NVIDIA +2 more