technique
DeepSpeed ZeRO
techniqueactive
deepspeed-zero-ef7565a4·1 events·first seen 28d agoAliases: DeepSpeed ZeRO
Co-occurring entities
More like this (12)
Recent events (1)
Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate
This Hugging Face blog post details inference optimization techniques for the BLOOM 176B parameter model using DeepSpeed ZeRO and Hugging Face Accelerate. The post provides PyTorch scripts and benchmarks demonstrating significant throughput improvements through tensor parallelism and other optimizations. It serves as a practical guide for deploying large open-weight models efficiently across multiple GPUs.