5Hugging Face Blog·1mo ago

Holotron-12B - High Throughput Computer Use Agent

Hcompany has released Holotron-12B, a 12-billion parameter model designed for computer use agent tasks with a focus on high throughput. The model is announced via the Hugging Face blog, suggesting it is available or soon available on the platform. Details on architecture, benchmarks, and capabilities are not present in the provided body text.

Frontier Model Releases Inference Economics Agent and Tool Ecosystem Hugging Face Hcompany Holotron-12B

Related guides (4)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How the Infrastructure Layer Around LLMs Is Consolidating

Read asIn-depth

Inference EconomicsTopic guide

Inference Economics: The Cost Structure of Running AI Models in Production

Read asIn-depth

Related events (8)

6Hugging Face Blog·18d ago·source ↗

H Company releases Holo3.1: fast local computer use agent model

H Company published a Hugging Face blog post announcing Holo3.1, a model designed for computer use agents that runs locally. The release targets fast, on-device computer control tasks, positioning it in the growing space of open/local agentic models. The body content is minimal, but the announcement signals a new entrant in the local computer-use agent category.

Open Weights Progress Agent and Tool Ecosystem Holo3.1 H Company

5Hugging Face Blog·1mo ago·source ↗

H Company's Holo2 235B-A22B Model Leads in UI Localization

H Company has released Holo2, a 235B parameter mixture-of-experts model with 22B active parameters, announced via the Hugging Face blog. The model is positioned as a leader in UI localization tasks, suggesting a focus on agent-oriented or multimodal UI understanding capabilities. The post appears to be a product/model introduction from H Company, a relatively newer AI lab.

Frontier Model Releases Agent and Tool Ecosystem Hugging Face Holo2 H Company +1 more

5Hugging Face Blog·1mo ago·source ↗

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

H Company has released Holo1, a new family of vision-language models specifically designed for GUI automation tasks. These models power Surfer-H, a GUI agent capable of interacting with graphical interfaces. The release represents a specialized VLM family targeting the agent-tool ecosystem for desktop/web automation. Details on architecture, training data, and benchmarks are expected in the accompanying blog post.

Agent and Tool Ecosystem Multimodal Progress Surfer-H Hugging Face Holo1 +1 more

6Github Trending·3d ago·source ↗

Microsoft releases Fara-7B, an efficient agentic model for computer use

Microsoft has published Fara-7B, a 7-billion-parameter model designed for agentic computer use tasks, available on GitHub. The repository has accumulated 5,834 stars with 97 added today, suggesting notable community interest. The model targets efficient computer-use agent workflows, a competitive area alongside models like Claude's computer use and similar offerings.

Frontier Model Releases Open Weights Progress Microsoft Fara-7B +1 more

8Google Deepmind Blog·1mo ago·source ↗

Introducing the Gemini 2.5 Computer Use model

Google DeepMind has released a preview of a specialized Computer Use model built on Gemini 2.5 Pro, available via API. The model is designed to power agents that can interact with user interfaces, extending Gemini 2.5 Pro's capabilities into computer-use agentic tasks. This positions Google as a direct competitor to Anthropic's Claude Computer Use and similar offerings in the emerging computer-use agent space.

Frontier Model Releases Agent and Tool Ecosystem Google DeepMind Gemini-2.5-Pro Gemini 2.5 Computer Use +1 more

9Anthropic News·17d ago·source ↗

Anthropic introduces computer use capability, upgraded Claude 3.5 Sonnet, and Claude 3.5 Haiku

Anthropic announced three major developments: an upgraded Claude 3.5 Sonnet with significant coding improvements (SWE-bench Verified rising from 33.4% to 49.0%, surpassing all publicly available models including reasoning models), a new Claude 3.5 Haiku that matches Claude 3 Opus performance at Haiku-tier speed, and a public beta of 'computer use' — a capability allowing Claude to control computers by viewing screens, moving cursors, clicking, and typing. Computer use is available via the Anthropic API, Amazon Bedrock, and Google Cloud Vertex AI, with early adopters including Replit, The Browser Company, and Cognition. Both safety institutes (US AISI and UK AISI) conducted pre-deployment testing, and the model was assessed as remaining within ASL-2 under Anthropic's Responsible Scaling Policy.

Frontier Model Releases Evaluation and Benchmarking OpenAI o1-preview Amazon Bedrock Claude 3.5 Sonnet +15 more

8Anthropic News·18d ago·source ↗

Anthropic Releases Computer Use Capability for Claude 3.5 Sonnet

Anthropic has launched a public beta of computer use for Claude 3.5 Sonnet, enabling the model to control a computer by interpreting screenshots and issuing pixel-level cursor and keyboard commands. The model achieves 14.9% on the OSWorld benchmark, roughly double the next-best AI model's 7.7%, though well below human-level performance of 70-75%. Anthropic trained the model on a small set of simple software tools and found it generalized rapidly to broader computer interaction. Safety analysis confirmed the capability remains at AI Safety Level 2, with prompt injection identified as a primary near-term risk.

Evaluation and Benchmarking AI Safety Research prompt injection Claude 3.5 Sonnet Responsible Scaling Policy +6 more

6Hugging Face Blog·1mo ago·source ↗

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

TII UAE has released Falcon-H1, a new family of hybrid-head language models combining attention and state-space mechanisms to improve efficiency and performance. The models are published on Hugging Face and represent TII's latest iteration in the Falcon series. The hybrid architecture targets better inference economics and competitive benchmark results relative to model size.

Frontier Model Releases Open Weights Progress Hugging Face Hybrid-Head Architecture Falcon-H1 +2 more