Almanac
product

Intel Gaudi

productactiveintel-gaudi-5ee12ec1·5 events·first seen 28d ago

Aliases: Intel Gaudi, Intel Gaudi 2

Co-occurring entities

More like this (12)

Recent events (5)

4Hugging Face Blog·28d ago·source ↗

Faster Assisted Generation Support for Intel Gaudi

Hugging Face has published a blog post detailing assisted generation (speculative decoding) support optimized for Intel Gaudi accelerators. The post covers implementation details and performance improvements achieved by running assisted/speculative decoding on Gaudi hardware. This represents an infrastructure and inference optimization development relevant to non-NVIDIA AI accelerator deployment.

4Hugging Face Blog·28d ago·source ↗

Accelerating LLM Inference with TGI on Intel Gaudi

Hugging Face's Text Generation Inference (TGI) framework has added a backend for Intel Gaudi accelerators, enabling LLM inference on Intel's AI hardware. The integration allows users to deploy large language models on Gaudi hardware using TGI's serving infrastructure. This expands the hardware ecosystem for LLM inference beyond NVIDIA GPUs, offering an alternative accelerator option for enterprise deployments.

4Hugging Face Blog·28d ago·source ↗

Building Cost-Efficient Enterprise RAG Applications with Intel Gaudi 2 and Intel Xeon

This Hugging Face blog post details how to build retrieval-augmented generation (RAG) pipelines for enterprise use cases using Intel Gaudi 2 accelerators and Intel Xeon CPUs. It covers the architecture and cost-efficiency tradeoffs of deploying RAG on Intel hardware as an alternative to GPU-based infrastructure. The post is positioned as a practical guide for organizations seeking lower-cost inference deployments.

4Hugging Face Blog·28d ago·source ↗

Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator

Hugging Face published a blog post detailing how to run text-generation pipelines on Intel's Gaudi 2 AI accelerator. The post covers integration between Hugging Face's text-generation tooling and Intel's Gaudi 2 hardware, positioning it as an alternative inference accelerator to NVIDIA GPUs. This is relevant to the growing ecosystem of non-NVIDIA AI inference hardware.

3Hugging Face Blog·28d ago·source ↗

Accelerating Protein Language Model ProtST on Intel Gaudi 2

A Hugging Face blog post details the acceleration of ProtST, a protein language model, on Intel's Gaudi 2 AI accelerator hardware. The post covers the technical integration and performance results of running this specialized biological ML model on Gaudi 2. This represents an intersection of domain-specific AI (protein modeling) and alternative AI hardware ecosystems.