Call Me Almanac

4Hugging Face Blog·1mo ago

Groq on Hugging Face Inference Providers

Hugging Face has added Groq as an inference provider in its Inference Providers ecosystem, allowing users to access Groq-hosted models directly through the Hugging Face platform. This integration enables developers to use Groq's LPU-based fast inference via the Hugging Face Hub interface and APIs. The partnership expands the multi-provider inference marketplace that Hugging Face has been building.

Inference Economics Agent and Tool Ecosystem Hugging Face Inference Providers LPU Hugging Face Groq

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Inference Providers on the Hub

Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.

Open Weights Progress Inference Economics Inference Providers Hugging Face +2 more

4Hugging Face Blog·1mo ago·source ↗

DeepInfra Added as Hugging Face Inference Provider

Hugging Face has added DeepInfra as an integrated inference provider on its platform. This expands the roster of third-party inference backends accessible directly through the Hugging Face ecosystem. The integration allows users to route model inference requests to DeepInfra's infrastructure via the standard Hugging Face Inference Providers interface.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Providers Hugging Face DeepInfra

4Hugging Face Blog·1mo ago·source ↗

Public AI on Hugging Face Inference Providers

Hugging Face announces the integration of Public AI as a new inference provider on its platform. This expands the ecosystem of third-party inference backends available through Hugging Face's unified API. The move continues the pattern of Hugging Face aggregating multiple inference providers to give developers flexible deployment options.

Inference Economics Agent and Tool Ecosystem Hugging Face Inference Providers Hugging Face Public AI

4Hugging Face Blog·1mo ago·source ↗

Cohere Models Now Available via Hugging Face Inference Providers

Hugging Face has added Cohere as an inference provider on its platform, enabling users to access Cohere models directly through the Hugging Face Inference API. This integration expands the Inference Providers ecosystem, which allows developers to run models from multiple vendors through a unified interface. The announcement reflects continued consolidation of model serving infrastructure across major AI providers.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Providers Cohere Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

Hugging Face Adds Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita

Hugging Face has expanded its serverless inference provider ecosystem by integrating three new partners: Hyperbolic, Nebius AI Studio, and Novita. These providers offer API-based inference for models hosted on the Hugging Face Hub, increasing the options available to developers for deploying open-weights models without managing infrastructure. The expansion reflects growing competition in the inference-as-a-service market targeting open-source AI workloads.

Inference Economics Agent and Tool Ecosystem Hyperbolic Novita Nebius AI Studio +1 more

4Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Inference for PRO Subscribers

Hugging Face introduced a dedicated inference tier for PRO subscribers, providing access to powerful models via API without rate limits typical of free tiers. The offering targets developers and researchers who need reliable, higher-throughput access to hosted models. This represents a monetization and infrastructure expansion move by Hugging Face to serve professional users.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference API Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Hugging Face and FriendliAI Partner to Supercharge Model Deployment on the Hub

Hugging Face and FriendliAI have announced a partnership to integrate FriendliAI's inference infrastructure directly into the Hugging Face Hub. The collaboration aims to simplify and accelerate model deployment for developers accessing models through the Hub. This expands the ecosystem of inference providers available on Hugging Face's platform.

Inference Economics Enterprise Deployment Patterns FriendliAI Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

Scaleway Joins Hugging Face Inference Providers

Scaleway has been added as an inference provider on the Hugging Face platform, expanding the ecosystem of third-party compute options available to developers. This integration allows users to route model inference through Scaleway's infrastructure directly via Hugging Face's unified API. The announcement reflects continued growth of the Hugging Face inference provider program as a multi-cloud deployment layer for open-weights models.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Providers Hugging Face Scaleway