H Company's Holo2 235B-A22B Model Leads in UI Localization
H Company has released Holo2, a 235B parameter mixture-of-experts model with 22B active parameters, announced via the Hugging Face blog. The model is positioned as a leader in UI localization tasks, suggesting a focus on agent-oriented or multimodal UI understanding capabilities. The post appears to be a product/model introduction from H Company, a relatively newer AI lab.
Related guides (4)
Related events (8)
H Company releases Holo3.1: fast local computer use agent model
H Company published a Hugging Face blog post announcing Holo3.1, a model designed for computer use agents that runs locally. The release targets fast, on-device computer control tasks, positioning it in the growing space of open/local agentic models. The body content is minimal, but the announcement signals a new entrant in the local computer-use agent category.
Holotron-12B - High Throughput Computer Use Agent
Hcompany has released Holotron-12B, a 12-billion parameter model designed for computer use agent tasks with a focus on high throughput. The model is announced via the Hugging Face blog, suggesting it is available or soon available on the platform. Details on architecture, benchmarks, and capabilities are not present in the provided body text.
Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H
H Company has released Holo1, a new family of vision-language models specifically designed for GUI automation tasks. These models power Surfer-H, a GUI agent capable of interacting with graphical interfaces. The release represents a specialized VLM family targeting the agent-tool ecosystem for desktop/web automation. Details on architecture, training data, and benchmarks are expected in the accompanying blog post.
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
This Hugging Face blog post provides a technical guide for fine-tuning Microsoft's Florence-2 vision-language models. Florence-2 is a compact yet capable multimodal model supporting tasks like captioning, object detection, and OCR. The post covers practical implementation details for adapting the model to custom datasets using the Hugging Face ecosystem.
Smol2Operator: Post-Training GUI Agents for Computer Use
Hugging Face published a blog post introducing Smol2Operator, a post-training approach for building GUI agents capable of computer use tasks. The work focuses on training small language models to operate graphical user interfaces, extending the SmolLM2 model family into the agent/computer-use domain. The post likely covers training methodology, datasets, and evaluation of the resulting GUI agent capabilities.
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
TII UAE has released Falcon-H1, a new family of hybrid-head language models combining attention and state-space mechanisms to improve efficiency and performance. The models are published on Hugging Face and represent TII's latest iteration in the Falcon series. The hybrid architecture targets better inference economics and competitive benchmark results relative to model size.
Meta releases Llama 4 Maverick 17B-128E multimodal instruct model on Hugging Face
Meta released Llama 4 Maverick, a 17B active parameter model with 128 experts (MoE architecture), as an image-text-to-text instruct model on Hugging Face. The model supports multimodal inputs and multiple languages including Arabic, German, and English. With 28K+ downloads and 493 likes shortly after release, it is seeing significant early adoption.
Introducing HUGS - Scale your AI with Open Models
Hugging Face announced HUGS (Hugging Face Generative Services), a new product aimed at helping enterprises scale AI deployments using open models. The service appears to target production inference infrastructure for open-weight models, positioning Hugging Face as a managed deployment layer. This is a product launch in the enterprise AI infrastructure space, competing with managed inference offerings from other providers.



