Hugging Face Launches Kernel Hub for Custom GPU Kernels
Hugging Face has introduced the Kernel Hub, a centralized repository for sharing and discovering custom GPU kernels optimized for AI/ML workloads. The platform aims to make high-performance custom CUDA and Triton kernels more accessible to the broader ML community. This represents an infrastructure layer addition to the Hugging Face ecosystem, complementing its existing model and dataset hubs.
Related guides (4)
Related events (8)
Easily Build and Share ROCm Kernels with Hugging Face
Hugging Face has published a guide and tooling for building and sharing custom ROCm kernels on its platform, targeting AMD GPU users in the ML ecosystem. The post covers the workflow for packaging, uploading, and reusing ROCm-based GPGPU kernels via the Hub. This lowers the barrier for AMD GPU kernel development and sharing, complementing the existing CUDA-centric kernel ecosystem. The initiative is relevant to inference optimization and the broader push to diversify GPU hardware support in AI workloads.
Hugging Face and AMD Partner to Accelerate Models on CPU and GPU Platforms
Hugging Face and AMD announced a partnership aimed at optimizing and accelerating state-of-the-art AI models across AMD's CPU and GPU hardware platforms. The collaboration targets improved performance for models hosted and distributed through Hugging Face's ecosystem. This represents a strategic move to broaden hardware support beyond NVIDIA-dominated infrastructure in the AI/ML deployment landscape.
Announcing New Hugging Face and KerasHub Integration
Hugging Face and KerasHub have announced a new integration enabling users to access Hugging Face models and datasets directly through the Keras ecosystem. This partnership bridges two major ML frameworks, allowing Keras users to leverage the Hugging Face Hub's model repository without leaving the Keras workflow. The integration is aimed at reducing friction for practitioners who prefer Keras-based training and inference pipelines.
Hugging Face and NVIDIA Launch Training Cluster as a Service
Hugging Face and NVIDIA are announcing a joint 'Training Cluster as a Service' offering, providing managed GPU cluster access for AI model training. The collaboration aims to lower the barrier for organizations to access large-scale training infrastructure without managing hardware directly. This represents a strategic partnership between a major AI platform and a leading GPU manufacturer to address enterprise training infrastructure needs.
Hugging Face on AMD Instinct MI300 GPU
Hugging Face announces support and optimization for AMD Instinct MI300 GPUs, expanding the ecosystem of hardware that can run Hugging Face models and tools. The post covers integration work enabling inference and training workloads on AMD's high-memory GPU accelerator. This represents a meaningful step in diversifying AI infrastructure beyond NVIDIA dominance.
Intel and Hugging Face Partner to Democratize Machine Learning Hardware Acceleration
Intel and Hugging Face announced a partnership aimed at making hardware acceleration for machine learning more accessible. The collaboration focuses on optimizing Hugging Face models and tools to run efficiently on Intel hardware. This represents an early-stage industry alignment between a major chip manufacturer and the dominant open-source ML model hub.
Improving Hugging Face Model Access for Kaggle Users
Hugging Face has announced an integration improvement that streamlines how Kaggle users access models from the Hugging Face Hub. The update appears to reduce friction for practitioners using Kaggle notebooks and compute environments to work with Hugging Face-hosted models. This represents a platform-level partnership move between two major ML community hubs.
Hugging Face Launches Inference Providers on the Hub
Hugging Face has introduced Inference Providers on the Hub, a feature that allows users to run models hosted on the Hub through third-party inference providers directly from the platform. This integration consolidates access to multiple inference backends under a unified interface, reducing friction for developers who want to deploy or test models at scale. The announcement positions Hugging Face as a marketplace layer connecting model authors with inference infrastructure providers.



