3GitHub Trending (AI/LLM filtered)·24d ago

NVIDIA NeMo Megatron-Bridge: Bidirectional Hugging Face Conversion for Megatron-Based Training

Megatron-Bridge is an NVIDIA NeMo training library for Megatron-based models that supports bidirectional conversion between Megatron and Hugging Face formats. The repository has accumulated 670 stars with modest daily growth (+5). It addresses a practical interoperability gap between the high-performance Megatron training stack and the broader HuggingFace ecosystem.

Training Infrastructure Agent and Tool Ecosystem NVIDIA NeMo Hugging Face Megatron-Bridge Megatron-LM

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Training InfrastructureTopic guide

Training Infrastructure: The Compute Arms Race Powering Modern AI

Read asBeginner In-depth

Related events (8)

7Hugging Face Blog·1mo ago·source ↗

Transformers v5: Simple model definitions powering the AI ecosystem

Hugging Face has announced Transformers v5, a major version update to its flagship open-source library. The release focuses on simplified model definitions and architectural improvements to the codebase. As one of the most widely used ML libraries in the ecosystem, this update has broad implications for researchers and practitioners building on top of the Transformers framework.

Open Weights Progress Inference Economics Transformers Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

Optimum + ONNX Runtime: Faster Training for Hugging Face Models

Hugging Face's Optimum library integrates with Microsoft's ONNX Runtime Training to accelerate fine-tuning of transformer models. The integration aims to reduce training time and memory usage with minimal code changes for practitioners using the Hugging Face ecosystem. This tooling update targets enterprise and research users looking to optimize training efficiency on existing hardware.

Training Infrastructure Agent and Tool Ecosystem Optimum Microsoft ONNX +1 more

4Hugging Face Blog·1mo ago·source ↗

Habana Labs and Hugging Face Partner to Accelerate Transformer Model Training

Habana Labs and Hugging Face announced a partnership to accelerate transformer model training on Habana's Gaudi AI processors. The collaboration aims to integrate Hugging Face's Transformers library with Habana's hardware, offering an alternative to GPU-based training infrastructure. This represents an early effort to diversify the AI training hardware ecosystem beyond NVIDIA dominance.

Training Infrastructure Inference Economics Habana Labs Gaudi Hugging Face Transformers +2 more

4Hugging Face Blog·1mo ago·source ↗

Accelerating Hugging Face Transformers with AWS Inferentia2

Hugging Face published a blog post detailing how to accelerate Transformer model inference using AWS Inferentia2, Amazon's second-generation ML inference chip. The post covers integration patterns between the Hugging Face ecosystem and the Neuron SDK for deploying models on Inferentia2 hardware. This represents a practical guide for enterprise and cloud-based inference deployment using dedicated AI accelerators.

Training Infrastructure Inference Economics AWS Inferentia2 Hugging Face Transformers Hugging Face +3 more

5Hugging Face Blog·1mo ago·source ↗

Sentence Transformers Joins Hugging Face

Sentence Transformers, a widely-used library for generating sentence embeddings and semantic similarity, is officially joining Hugging Face. This integration brings the popular embedding framework under the Hugging Face ecosystem, likely enabling tighter integration with the Hub, datasets, and other HF tooling. The move consolidates a key component of the NLP/embedding pipeline within the dominant open-source AI platform.

Enterprise Deployment Patterns Agent and Tool Ecosystem Hugging Face Sentence Transformers

3Hugging Face Blog·1mo ago·source ↗

Training a Language Model with Hugging Face Transformers Using TensorFlow and TPUs

This Hugging Face blog post provides a technical walkthrough for training a language model using TensorFlow and Google TPUs via the Transformers library. It covers the practical setup, data pipeline, and training configuration required to leverage TPU hardware with the TF ecosystem. The post serves as a tutorial bridging Hugging Face tooling with TPU-based infrastructure.

Training Infrastructure Agent and Tool Ecosystem Google TPU Hugging Face Transformers Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

The Transformers Library: Standardizing Model Definitions

Hugging Face published a blog post outlining their approach to standardizing model definitions within the Transformers library. The post addresses how the library structures and maintains model code to ensure consistency, reproducibility, and ease of integration across a wide range of architectures. This is a tooling and ecosystem development relevant to practitioners building on or contributing to the Transformers framework.

Open Weights Progress Agent and Tool Ecosystem Transformers Hugging Face

6Hugging Face Blog·1mo ago·source ↗

A Gentle Introduction to 8-bit Matrix Multiplication for Transformers at Scale using Hugging Face and bitsandbytes

This Hugging Face blog post introduces 8-bit quantization for large transformer models via integration of the bitsandbytes library with the transformers and accelerate libraries. It explains how LLM.int8() enables loading large models in 8-bit precision, significantly reducing GPU memory requirements without major accuracy degradation. The post covers the technical mechanics of mixed-precision decomposition and how practitioners can use the integration in practice.

Training Infrastructure Open Weights Progress Transformers Tim Dettmers Accelerate +4 more