Almanac
model

BLOOMZ

modelactivebloomz-327c621f·1 events·first seen 28d ago

Aliases: BLOOMZ

Co-occurring entities

More like this (12)

Recent events (1)

4Hugging Face Blog·28d ago·source ↗

Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator

This Hugging Face blog post covers deploying BLOOMZ, a large multilingual language model, on Intel's Habana Gaudi2 accelerator for inference. It benchmarks throughput and latency performance on Gaudi2 as an alternative to GPU-based inference. The post is part of ongoing work to demonstrate non-NVIDIA hardware options for large model deployment.