Call Me Almanac

3Hugging Face Blog·1mo ago

An Overview of Inference Solutions on Hugging Face

Hugging Face published a blog post surveying its inference product offerings as of late 2022. The post covers the range of hosted and API-based inference solutions available on the platform, aimed at helping developers choose appropriate deployment paths. This serves as a reference overview of Hugging Face's inference infrastructure ecosystem at that time.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Endpoints Hugging Face Inference API Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Inference Providers on the Hub

Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.

Open Weights Progress Inference Economics Inference Providers Hugging Face +2 more

3Hugging Face Blog·1mo ago·source ↗

Hugging Face Adds New Analytics Dashboard to Inference Endpoints

Hugging Face has released updated analytics features for its Inference Endpoints product, providing users with improved visibility into deployment metrics and usage patterns. The announcement covers new dashboards and monitoring capabilities for hosted model inference. This is a product update targeting enterprise and developer users running models on Hugging Face's managed inference infrastructure.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Endpoints Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Hugging Face Adds Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita

Hugging Face has expanded its serverless inference provider ecosystem by integrating three new partners: Hyperbolic, Nebius AI Studio, and Novita. These providers offer API-based inference for models hosted on the Hugging Face Hub, increasing the options available to developers for deploying open-weights models without managing infrastructure. The expansion reflects growing competition in the inference-as-a-service market targeting open-source AI workloads.

Inference Economics Agent and Tool Ecosystem Hyperbolic Novita Nebius AI Studio +1 more

4Hugging Face Blog·1mo ago·source ↗

Public AI on Hugging Face Inference Providers

Hugging Face announces the integration of Public AI as a new inference provider on its platform. This expands the ecosystem of third-party inference backends available through Hugging Face's unified API. The move continues the pattern of Hugging Face aggregating multiple inference providers to give developers flexible deployment options.

Inference Economics Agent and Tool Ecosystem Hugging Face Inference Providers Hugging Face Public AI

4Hugging Face Blog·1mo ago·source ↗

DeepInfra Added as Hugging Face Inference Provider

Hugging Face has added DeepInfra as an integrated inference provider on its platform. This expands the roster of third-party inference backends accessible directly through the Hugging Face ecosystem. The integration allows users to route model inference requests to DeepInfra's infrastructure via the standard Hugging Face Inference Providers interface.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Providers Hugging Face DeepInfra

4Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Inference for PRO Subscribers

Hugging Face introduced a dedicated inference tier for PRO subscribers, providing access to powerful models via API without rate limits typical of free tiers. The offering targets developers and researchers who need reliable, higher-throughput access to hosted models. This represents a monetization and infrastructure expansion move by Hugging Face to serve professional users.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference API Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Featherless AI Joins Hugging Face Inference Providers

Hugging Face has added Featherless AI as a new inference provider in its Inference Providers ecosystem. Featherless AI specializes in serverless inference for open-weight models, expanding the range of third-party compute options available through the Hugging Face platform. This integration allows developers to route model inference requests to Featherless AI directly via the Hugging Face API and model hub.

Open Weights Progress Inference Economics Hugging Face Inference Providers Featherless AI Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

Cohere Models Now Available via Hugging Face Inference Providers

Hugging Face has added Cohere as an inference provider on its platform, enabling users to access Cohere models directly through the Hugging Face Inference API. This integration expands the Inference Providers ecosystem, which allows developers to run models from multiple vendors through a unified interface. The announcement reflects continued consolidation of model serving infrastructure across major AI providers.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Providers Cohere Hugging Face +1 more