3Hugging Face Blog·1mo ago

Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2

This Hugging Face blog post covers the deployment and acceleration of BridgeTower, a vision-language model, on Intel's Habana Gaudi2 AI accelerator hardware. The piece likely benchmarks inference throughput and training performance on Gaudi2 compared to other hardware. It represents a practical infrastructure and deployment case study for multimodal models on alternative AI accelerators.

Training Infrastructure Inference Economics Multimodal Progress BridgeTower Habana Gaudi Hugging Face Intel

Related guides (4)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner

Training InfrastructureTopic guide

Training Infrastructure: The Compute Arms Race Powering Modern AI

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost Structure of Running AI Models in Production

Read asIn-depth

Related events (8)

4Hugging Face Blog·1mo ago·source ↗

Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator

This Hugging Face blog post covers deploying BLOOMZ, a large multilingual language model, on Intel's Habana Gaudi2 accelerator for inference. It benchmarks throughput and latency performance on Gaudi2 as an alternative to GPU-based inference. The post is part of ongoing work to demonstrate non-NVIDIA hardware options for large model deployment.

Training Infrastructure Open Weights Progress BLOOMZ Habana Gaudi BLOOM +3 more

4Hugging Face Blog·1mo ago·source ↗

Getting Started with Transformers on Habana Gaudi

This Hugging Face blog post introduces integration between the Transformers library and Habana Gaudi AI accelerators. It provides a practical guide for running transformer model training and inference on Gaudi hardware as an alternative to GPU-based infrastructure. The post signals growing ecosystem support for non-NVIDIA AI accelerator hardware.

Training Infrastructure Inference Economics Transformers Habana Gaudi Hugging Face +1 more

3Hugging Face Blog·1mo ago·source ↗

Accelerating Protein Language Model ProtST on Intel Gaudi 2

A Hugging Face blog post details the acceleration of ProtST, a protein language model, on Intel's Gaudi 2 AI accelerator hardware. The post covers the technical integration and performance results of running this specialized biological ML model on Gaudi 2. This represents an intersection of domain-specific AI (protein modeling) and alternative AI hardware ecosystems.

Training Infrastructure Inference Economics ProtST Intel Gaudi Hugging Face +1 more

4Hugging Face Blog·1mo ago·source ↗

Habana Labs and Hugging Face Partner to Accelerate Transformer Model Training

Habana Labs and Hugging Face announced a partnership to accelerate transformer model training on Habana's Gaudi AI processors. The collaboration aims to integrate Hugging Face's Transformers library with Habana's hardware, offering an alternative to GPU-based training infrastructure. This represents an early effort to diversify the AI training hardware ecosystem beyond NVIDIA dominance.

Training Infrastructure Inference Economics Habana Labs Gaudi Hugging Face Transformers +2 more

4Hugging Face Blog·1mo ago·source ↗

Accelerating LLM Inference with TGI on Intel Gaudi

Hugging Face's Text Generation Inference (TGI) framework has added a backend for Intel Gaudi accelerators, enabling LLM inference on Intel's AI hardware. The integration allows users to deploy large language models on Gaudi hardware using TGI's serving infrastructure. This expands the hardware ecosystem for LLM inference beyond NVIDIA GPUs, offering an alternative accelerator option for enterprise deployments.

Training Infrastructure Inference Economics Text Generation Inference Intel Gaudi Hugging Face +2 more

4Hugging Face Blog·1mo ago·source ↗

Faster Training and Inference: Habana Gaudi®2 vs Nvidia A100 80GB

Hugging Face published a benchmark comparison between Intel Habana Gaudi 2 and Nvidia A100 80GB GPUs for training and inference workloads. The post evaluates performance across common ML tasks to assess Gaudi 2 as an alternative accelerator. This is relevant to the broader question of GPU alternatives and inference economics in AI infrastructure.

Training Infrastructure Inference Economics Habana Gaudi Hugging Face Intel +1 more

4Hugging Face Blog·1mo ago·source ↗

Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator

Hugging Face published a blog post detailing how to run text-generation pipelines on Intel's Gaudi 2 AI accelerator. The post covers integration between Hugging Face's text-generation tooling and Intel's Gaudi 2 hardware, positioning it as an alternative inference accelerator to NVIDIA GPUs. This is relevant to the growing ecosystem of non-NVIDIA AI inference hardware.

Training Infrastructure Inference Economics Intel Gaudi Hugging Face Transformers Hugging Face +1 more

3Hugging Face Blog·1mo ago·source ↗

Pre-Train BERT with Hugging Face Transformers and Habana Gaudi

This Hugging Face blog post from August 2022 describes how to pre-train a BERT model from scratch using the Hugging Face Transformers library on Habana Gaudi hardware accelerators. It covers the full pipeline including data preparation, tokenizer training, and masked language modeling pretraining. The post serves as both a technical tutorial and a demonstration of Habana Gaudi's viability as an alternative AI training accelerator.

Training Infrastructure Habana Gaudi Hugging Face Transformers Hugging Face +2 more