5Hugging Face Blog·1mo ago

Xet Storage Integration on Hugging Face Hub

Hugging Face has integrated Xet, a chunk-based deduplication storage backend, into the Hub to improve large model file storage and transfer efficiency. The integration aims to reduce redundant data storage and speed up uploads/downloads for large model weights by splitting files into content-addressed chunks. This is an infrastructure improvement relevant to the open-weights ecosystem where multi-gigabyte model files are common.

Open Weights Progress Inference Economics Xet Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

Migrating the Hugging Face Hub from Git LFS to Xet

Hugging Face is migrating its model and dataset hosting infrastructure from Git LFS to Xet, a content-addressed storage system designed for large binary files. The migration aims to improve upload/download speeds, deduplication, and storage efficiency for the large model weights and datasets hosted on the Hub. This represents a significant infrastructure change affecting how millions of AI artifacts are stored and accessed by the community.

Training Infrastructure Inference Economics Xet Git LFS Hugging Face

5Hugging Face Blog·1mo ago·source ↗

XetHub Joins Hugging Face

XetHub, a company specializing in large-scale file storage and versioning for ML datasets and models, is being acquired by Hugging Face. The acquisition is intended to strengthen Hugging Face's infrastructure for hosting and managing large model and dataset repositories. This move reflects ongoing consolidation in the AI tooling and infrastructure space around the Hugging Face platform.

Training Infrastructure Agent and Tool Ecosystem XetHub Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face is launching Storage Buckets, a new feature on the Hub that provides object storage capabilities for AI/ML workflows. This expands the Hub's infrastructure offerings beyond model and dataset repositories, enabling users to store arbitrary files and artifacts. The feature targets teams managing large-scale AI pipelines who need integrated storage alongside their models and datasets.

Enterprise Deployment Patterns Agent and Tool Ecosystem Hugging Face Storage Buckets

5Hugging Face Blog·1mo ago·source ↗

Announcing New Hugging Face and KerasHub Integration

Hugging Face and KerasHub have announced a new integration enabling users to access Hugging Face models and datasets directly through the Keras ecosystem. This partnership bridges two major ML frameworks, allowing Keras users to leverage the Hugging Face Hub's model repository without leaving the Keras workflow. The integration is aimed at reducing friction for practitioners who prefer Keras-based training and inference pipelines.

Enterprise Deployment Patterns Agent and Tool Ecosystem Keras KerasHub Hugging Face

6Hugging Face Blog·25d ago·source ↗

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

Hugging Face introduces Delta Weight Sync in TRL, a technique for efficiently synchronizing model weight updates during large-scale training by transmitting only the delta (difference) between checkpoints rather than full parameter snapshots. The approach targets trillion-parameter training regimes where checkpoint bandwidth is a significant bottleneck. The post describes integration with the Hugging Face Hub as a storage and distribution layer for these delta updates.

Training Infrastructure Inference Economics Hugging Face Delta Weight Sync TRL

4Hugging Face Blog·1mo ago·source ↗

Improving Hugging Face Model Access for Kaggle Users

Hugging Face has announced an integration improvement that streamlines how Kaggle users access models from the Hugging Face Hub. The update appears to reduce friction for practitioners using Kaggle notebooks and compute environments to work with Hugging Face-hosted models. This represents a platform-level partnership move between two major ML community hubs.

Enterprise Deployment Patterns Agent and Tool Ecosystem Kaggle Hugging Face

6Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Inference Providers on the Hub

Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.

Open Weights Progress Inference Economics Inference Providers Hugging Face +2 more

5Hugging Face Blog·1mo ago·source ↗

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Hugging Face announced integration with NVIDIA DGX Cloud, enabling users to train models on H100 GPU clusters directly through the Hugging Face platform. The partnership simplifies access to high-end training infrastructure without requiring users to manage cloud provisioning themselves. This represents a continued push to lower the barrier to large-scale model training for the broader ML community.

Training Infrastructure Inference Economics NVIDIA NVIDIA DGX Cloud H100 +2 more