Almanac
← Events
4Hugging Face Blog·1mo ago

Intel and Hugging Face Partner to Democratize Machine Learning Hardware Acceleration

Intel and Hugging Face announced a partnership aimed at making hardware acceleration for machine learning more accessible. The collaboration focuses on optimizing Hugging Face models and tools to run efficiently on Intel hardware. This represents an early-stage industry alignment between a major chip manufacturer and the dominant open-source ML model hub.

Related guides (4)

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

Hugging Face and AMD Partner to Accelerate Models on CPU and GPU Platforms

Hugging Face and AMD announced a partnership aimed at optimizing and accelerating state-of-the-art AI models across AMD's CPU and GPU hardware platforms. The collaboration targets improved performance for models hosted and distributed through Hugging Face's ecosystem. This represents a strategic move to broaden hardware support beyond NVIDIA-dominated infrastructure in the AI/ML deployment landscape.

4Hugging Face Blog·1mo ago·source ↗

Hugging Face and Graphcore Partner for IPU-Optimized Transformers

Hugging Face and Graphcore announced a partnership to optimize Transformer models for Graphcore's Intelligence Processing Unit (IPU) hardware. The collaboration aims to make IPU-accelerated inference and training accessible through the Hugging Face ecosystem. This represents an early effort to broaden AI hardware options beyond GPU-dominated infrastructure.

5Hugging Face Blog·1mo ago·source ↗

Hugging Face on AMD Instinct MI300 GPU

Hugging Face announces support and optimization for AMD Instinct MI300 GPUs, expanding the ecosystem of hardware that can run Hugging Face models and tools. The post covers integration work enabling inference and training workloads on AMD's high-memory GPU accelerator. This represents a meaningful step in diversifying AI infrastructure beyond NVIDIA dominance.

4Hugging Face Blog·1mo ago·source ↗

Accelerating Hugging Face Transformers with AWS Inferentia2

Hugging Face published a blog post detailing how to accelerate Transformer model inference using AWS Inferentia2, Amazon's second-generation ML inference chip. The post covers integration patterns between the Hugging Face ecosystem and the Neuron SDK for deploying models on Inferentia2 hardware. This represents a practical guide for enterprise and cloud-based inference deployment using dedicated AI accelerators.

4Hugging Face Blog·1mo ago·source ↗

Hugging Face and FriendliAI Partner to Supercharge Model Deployment on the Hub

Hugging Face and FriendliAI have announced a partnership to integrate FriendliAI's inference infrastructure directly into the Hugging Face Hub. The collaboration aims to simplify and accelerate model deployment for developers accessing models through the Hub. This expands the ecosystem of inference providers available on Hugging Face's platform.

4Hugging Face Blog·1mo ago·source ↗

Habana Labs and Hugging Face Partner to Accelerate Transformer Model Training

Habana Labs and Hugging Face announced a partnership to accelerate transformer model training on Habana's Gaudi AI processors. The collaboration aims to integrate Hugging Face's Transformers library with Habana's hardware, offering an alternative to GPU-based training infrastructure. This represents an early effort to diversify the AI training hardware ecosystem beyond NVIDIA dominance.

6Hugging Face Blog·1mo ago·source ↗

Hugging Face and NVIDIA Launch Training Cluster as a Service

Hugging Face and NVIDIA are announcing a joint 'Training Cluster as a Service' offering, providing managed GPU cluster access for AI model training. The collaboration aims to lower the barrier for organizations to access large-scale training infrastructure without managing hardware directly. This represents a strategic partnership between a major AI platform and a leading GPU manufacturer to address enterprise training infrastructure needs.

5Hugging Face Blog·1mo ago·source ↗

Hugging Face Launches Kernel Hub for Custom GPU Kernels

Hugging Face has introduced the Kernel Hub, a centralized repository for sharing and discovering custom GPU kernels optimized for AI/ML workloads. The platform aims to make high-performance custom CUDA and Triton kernels more accessible to the broader ML community. This represents an infrastructure layer addition to the Hugging Face ecosystem, complementing its existing model and dataset hubs.