Introducing HUGS - Scale your AI with Open Models
Hugging Face announced HUGS (Hugging Face Generative Services), a new product aimed at helping enterprises scale AI deployments using open models. The service appears to target production inference infrastructure for open-weight models, positioning Hugging Face as a managed deployment layer. This is a product launch in the enterprise AI infrastructure space, competing with managed inference offerings from other providers.
Related guides (4)
Related events (8)
Hugging Face Launches Inference Providers on the Hub
Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.
Hugging Face and Google Partner for Open AI Collaboration
Hugging Face and Google have announced a partnership focused on open AI collaboration, expanding access to Hugging Face models and tools on Google Cloud Platform. The deal deepens integration between Hugging Face's model hub and Google's cloud infrastructure, enabling easier deployment of open-source models via GCP services. This follows a pattern of major cloud providers forming strategic alliances with leading open-source AI platforms.
Hugging Face and AWS Partner to Make AI More Accessible
Hugging Face announced a strategic partnership with Amazon Web Services to expand access to AI models and tools. The collaboration aims to integrate Hugging Face's model hub and libraries more deeply with AWS infrastructure and services. This represents a significant enterprise deployment and cloud distribution move for the open-source AI ecosystem.
Hugging Face Adds Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita
Hugging Face has expanded its serverless inference provider ecosystem by integrating three new partners: Hyperbolic, Nebius AI Studio, and Novita. These providers offer API-based inference for models hosted on the Hugging Face Hub, increasing the options available to developers for deploying open-weights models without managing infrastructure. The expansion reflects growing competition in the inference-as-a-service market targeting open-source AI workloads.
Hugging Face and Google Cloud Announce New Partnership
Hugging Face has announced a new partnership with Google Cloud, framed around building an open AI future. The blog post outlines collaboration between the two organizations, though the body content is not provided. This partnership likely involves deeper integration of Hugging Face's open-weights model hub and tooling with Google Cloud's infrastructure and services.
Public AI on Hugging Face Inference Providers
Hugging Face announces the integration of Public AI as a new inference provider on its platform. This expands the ecosystem of third-party inference backends available through Hugging Face's unified API. The move continues the pattern of Hugging Face aggregating multiple inference providers to give developers flexible deployment options.
Hugging Face and NVIDIA Launch Training Cluster as a Service
Hugging Face and NVIDIA are announcing a joint 'Training Cluster as a Service' offering, providing managed GPU cluster access for AI model training. The collaboration aims to lower the barrier for organizations to access large-scale training infrastructure without managing hardware directly. This represents a strategic partnership between a major AI platform and a leading GPU manufacturer to address enterprise training infrastructure needs.
CUGA on Hugging Face: Democratizing Configurable AI Agents
IBM Research has released CUGA (Configurable Universal Generative Agent) on Hugging Face, positioning it as a framework for building configurable AI agents. The announcement appears on the Hugging Face blog as a tier-2 commentary piece from IBM Research. Details on architecture, benchmarks, and specific capabilities are not available from the body text provided.



