4Hugging Face Blog·1mo ago

Deploy Hugging Face Models Easily with Amazon SageMaker

Hugging Face and Amazon SageMaker announced an integration enabling streamlined deployment of Hugging Face models via SageMaker's managed infrastructure. The partnership provides dedicated Hugging Face Deep Learning Containers on AWS, simplifying the path from model hub to production inference. This represents an early milestone in the enterprise deployment pattern of hosted model hubs integrating with cloud ML platforms.

Inference Economics Enterprise Deployment Patterns Amazon SageMaker Hugging Face Deep Learning Containers Hugging Face Amazon Web Services

Related guides (4)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Amazon Web Services

Amazon Web Services: The Cloud Backbone of the AI Era

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From LLM Demo to Production Reality

Read asIn-depth

Inference EconomicsTopic guide

Inference Economics: The Cost Structure of Running AI Models in Production

Read asIn-depth

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

The Partnership: Amazon SageMaker and Hugging Face

Hugging Face and Amazon announced a partnership integrating Hugging Face models and tools natively into Amazon SageMaker. This collaboration enables developers to train and deploy Hugging Face Transformers models directly within SageMaker's managed ML infrastructure. The partnership represents an early major cloud-provider integration for Hugging Face, expanding enterprise access to open-source NLP models.

Enterprise Deployment Patterns Agent and Tool Ecosystem Amazon SageMaker Hugging Face Transformers Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

Introducing the Hugging Face Embedding Container for Amazon SageMaker

Hugging Face has launched a dedicated embedding container for Amazon SageMaker, enabling streamlined deployment of text embedding models on AWS infrastructure. The container is designed to simplify production deployment of embedding models for use cases like semantic search and retrieval-augmented generation. This represents a deeper integration between Hugging Face's model ecosystem and AWS's managed ML platform.

Inference Economics Enterprise Deployment Patterns Amazon SageMaker Hugging Face Hugging Face Embedding Container +1 more

6Hugging Face Blog·1mo ago·source ↗

Hugging Face and AWS Partner to Make AI More Accessible

Hugging Face announced a strategic partnership with Amazon Web Services to expand access to AI models and tools. The collaboration aims to integrate Hugging Face's model hub and libraries more deeply with AWS infrastructure and services. This represents a significant enterprise deployment and cloud distribution move for the open-source AI ecosystem.

Open Weights Progress Inference Economics Hugging Face Amazon Web Services +1 more

5Hugging Face Blog·1mo ago·source ↗

Introducing the Hugging Face LLM Inference Container for Amazon SageMaker

Hugging Face and Amazon Web Services have launched a dedicated LLM inference container for Amazon SageMaker, enabling optimized deployment of large language models on managed cloud infrastructure. The container is built on Hugging Face's Text Generation Inference (TGI) toolkit, which supports features like continuous batching, tensor parallelism, and quantization. This integration lowers the barrier for enterprise teams to deploy open-weight LLMs at scale on AWS without managing custom serving infrastructure.

Open Weights Progress Inference Economics Text Generation Inference Amazon SageMaker tensor parallelism +4 more

5Hugging Face Blog·1mo ago·source ↗

Hugging Face Models Now Available in Amazon Bedrock Marketplace

Hugging Face has announced that its models are now accessible through Amazon Bedrock's model marketplace, enabling AWS customers to deploy Hugging Face models via Bedrock's managed infrastructure. This integration allows enterprise users to access open-weight and proprietary Hugging Face models without managing their own inference infrastructure. The partnership expands the distribution channel for Hugging Face models into AWS's enterprise customer base.

Open Weights Progress Inference Economics Amazon Bedrock Hugging Face Amazon Web Services +1 more

6Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Inference Providers on the Hub

Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.

Open Weights Progress Inference Economics Inference Providers Hugging Face +2 more

4Hugging Face Blog·1mo ago·source ↗

Improving Hugging Face Model Access for Kaggle Users

Hugging Face has announced an integration improvement that streamlines how Kaggle users access models from the Hugging Face Hub. The update appears to reduce friction for practitioners using Kaggle notebooks and compute environments to work with Hugging Face-hosted models. This represents a platform-level partnership move between two major ML community hubs.

Enterprise Deployment Patterns Agent and Tool Ecosystem Kaggle Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Hugging Face and FriendliAI Partner to Supercharge Model Deployment on the Hub

Hugging Face and FriendliAI have announced a partnership to integrate FriendliAI's inference infrastructure directly into the Hugging Face Hub. The collaboration aims to simplify and accelerate model deployment for developers accessing models through the Hub. This expands the ecosystem of inference providers available on Hugging Face's platform.

Inference Economics Enterprise Deployment Patterns FriendliAI Hugging Face +1 more