model
BLOOMZ
modelactive
bloomz-327c621f·1 events·first seen 28d agoAliases: BLOOMZ
Co-occurring entities
More like this (12)
Recent events (1)
Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator
This Hugging Face blog post covers deploying BLOOMZ, a large multilingual language model, on Intel's Habana Gaudi2 accelerator for inference. It benchmarks throughput and latency performance on Gaudi2 as an alternative to GPU-based inference. The post is part of ongoing work to demonstrate non-NVIDIA hardware options for large model deployment.