Hugging Face Optimum
hugging-face-optimum-a558f202·2 events·first seen 28d agoAliases: Hugging Face Optimum
Co-occurring entities
More like this (12)
Recent events (2)
Convert Transformers to ONNX with Hugging Face Optimum
Hugging Face published a guide on converting Transformer models to ONNX format using the Optimum library. The post covers the tooling workflow for exporting models from the Transformers ecosystem into ONNX for optimized inference deployment. This is a practical infrastructure topic relevant to production ML deployment patterns.
Optimizing Stable Diffusion for Intel CPUs with NNCF and Hugging Face Optimum
This Hugging Face blog post details techniques for optimizing Stable Diffusion inference on Intel CPUs using Neural Network Compression Framework (NNCF) and the Optimum library. The workflow covers quantization and other compression methods to reduce latency and memory footprint on CPU hardware. This is relevant to the inference-economics and enterprise-deployment threads as it addresses running diffusion models without dedicated GPU hardware.