4Hugging Face Blog·1mo ago

Hugging Face Adds Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita

Hugging Face has expanded its serverless inference provider ecosystem by integrating three new partners: Hyperbolic, Nebius AI Studio, and Novita. These providers offer API-based inference for models hosted on the Hugging Face Hub, increasing the options available to developers for deploying open-weights models without managing infrastructure. The expansion reflects growing competition in the inference-as-a-service market targeting open-source AI workloads.

Inference Economics Agent and Tool Ecosystem Hyperbolic Novita Nebius AI Studio Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Inference Providers on the Hub

Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.

Open Weights Progress Inference Economics Inference Providers Hugging Face +2 more

4Hugging Face Blog·1mo ago·source ↗

Featherless AI Joins Hugging Face Inference Providers

Hugging Face has added Featherless AI as a new inference provider in its Inference Providers ecosystem. Featherless AI specializes in serverless inference for open-weight models, expanding the range of third-party compute options available through the Hugging Face platform. This integration allows developers to route model inference requests to Featherless AI directly via the Hugging Face API and model hub.

Open Weights Progress Inference Economics Hugging Face Inference Providers Featherless AI Hugging Face +1 more

5Hugging Face Blog·1mo ago·source ↗

Serverless Inference with Hugging Face and NVIDIA NIM

Hugging Face and NVIDIA have partnered to offer serverless inference via NVIDIA NIM microservices on DGX Cloud infrastructure. The integration allows developers to run optimized model inference without managing GPU infrastructure, combining Hugging Face's model hub with NVIDIA's inference optimization stack. This represents an expansion of the existing Hugging Face–NVIDIA partnership into managed inference services.

Training Infrastructure Inference Economics NVIDIA NIM NVIDIA DGX Cloud +2 more

4Hugging Face Blog·1mo ago·source ↗

Public AI on Hugging Face Inference Providers

Hugging Face announces the integration of Public AI as a new inference provider on its platform. This expands the ecosystem of third-party inference backends available through Hugging Face's unified API. The move continues the pattern of Hugging Face aggregating multiple inference providers to give developers flexible deployment options.

Inference Economics Agent and Tool Ecosystem Hugging Face Inference Providers Hugging Face Public AI

5Hugging Face Blog·1mo ago·source ↗

Bringing Serverless GPU Inference to Hugging Face Users via Cloudflare Workers AI

Hugging Face and Cloudflare have partnered to bring serverless GPU inference to Hugging Face users through Cloudflare Workers AI. The integration allows developers to run Hugging Face models on Cloudflare's global edge network without managing GPU infrastructure. This represents an expansion of serverless inference options for the Hugging Face ecosystem, lowering the barrier to deploying ML models at scale.

Inference Economics Enterprise Deployment Patterns Cloudflare Workers AI Hugging Face Cloudflare +1 more

4Hugging Face Blog·1mo ago·source ↗

Hugging Face and FriendliAI Partner to Supercharge Model Deployment on the Hub

Hugging Face and FriendliAI have announced a partnership to integrate FriendliAI's inference infrastructure directly into the Hugging Face Hub. The collaboration aims to simplify and accelerate model deployment for developers accessing models through the Hub. This expands the ecosystem of inference providers available on Hugging Face's platform.

Inference Economics Enterprise Deployment Patterns FriendliAI Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

Cohere Models Now Available via Hugging Face Inference Providers

Hugging Face has added Cohere as an inference provider on its platform, enabling users to access Cohere models directly through the Hugging Face Inference API. This integration expands the Inference Providers ecosystem, which allows developers to run models from multiple vendors through a unified interface. The announcement reflects continued consolidation of model serving infrastructure across major AI providers.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Providers Cohere Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

DeepInfra Added as Hugging Face Inference Provider

Hugging Face has added DeepInfra as an integrated inference provider on its platform. This expands the roster of third-party inference backends accessible directly through the Hugging Face ecosystem. The integration allows users to route model inference requests to DeepInfra's infrastructure via the standard Hugging Face Inference Providers interface.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Providers Hugging Face DeepInfra