Groq on Hugging Face Inference Providers
Hugging Face has added Groq as an inference provider in its Inference Providers ecosystem, allowing users to access Groq-hosted models directly through the Hugging Face platform. This integration enables developers to use Groq's LPU-based fast inference via the Hugging Face Hub interface and APIs. The partnership expands the multi-provider inference marketplace that Hugging Face has been building.
Related guides (3)
Related events (8)
Hugging Face Launches Inference Providers on the Hub
Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.
DeepInfra Added as Hugging Face Inference Provider
Hugging Face has added DeepInfra as an integrated inference provider on its platform. This expands the roster of third-party inference backends accessible directly through the Hugging Face ecosystem. The integration allows users to route model inference requests to DeepInfra's infrastructure via the standard Hugging Face Inference Providers interface.
Public AI on Hugging Face Inference Providers
Hugging Face announces the integration of Public AI as a new inference provider on its platform. This expands the ecosystem of third-party inference backends available through Hugging Face's unified API. The move continues the pattern of Hugging Face aggregating multiple inference providers to give developers flexible deployment options.
Cohere Models Now Available via Hugging Face Inference Providers
Hugging Face has added Cohere as an inference provider on its platform, enabling users to access Cohere models directly through the Hugging Face Inference API. This integration expands the Inference Providers ecosystem, which allows developers to run models from multiple vendors through a unified interface. The announcement reflects continued consolidation of model serving infrastructure across major AI providers.
Hugging Face Adds Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita
Hugging Face has expanded its serverless inference provider ecosystem by integrating three new partners: Hyperbolic, Nebius AI Studio, and Novita. These providers offer API-based inference for models hosted on the Hugging Face Hub, increasing the options available to developers for deploying open-weights models without managing infrastructure. The expansion reflects growing competition in the inference-as-a-service market targeting open-source AI workloads.
Hugging Face Launches Inference for PRO Subscribers
Hugging Face introduced a dedicated inference tier for PRO subscribers, providing access to powerful models via API without rate limits typical of free tiers. The offering targets developers and researchers who need reliable, higher-throughput access to hosted models. This represents a monetization and infrastructure expansion move by Hugging Face to serve professional users.
Hugging Face and FriendliAI Partner to Supercharge Model Deployment on the Hub
Hugging Face and FriendliAI have announced a partnership to integrate FriendliAI's inference infrastructure directly into the Hugging Face Hub. The collaboration aims to simplify and accelerate model deployment for developers accessing models through the Hub. This expands the ecosystem of inference providers available on Hugging Face's platform.
Scaleway Joins Hugging Face Inference Providers
Scaleway has been added as an inference provider on the Hugging Face platform, expanding the ecosystem of third-party compute options available to developers. This integration allows users to route model inference through Scaleway's infrastructure directly via Hugging Face's unified API. The announcement reflects continued growth of the Hugging Face inference provider program as a multi-cloud deployment layer for open-weights models.


