Entity · technique

ONNX

techniqueactiveonnx-b402285f·6 events·first seen May 19, 2026

Aliases: ONNX, ONNX Runtime

Merged from

ONNX Runtime

Co-occurring entities

Hugging Face Optimum Microsoft Transformers Pipelines Transformers Hugging Face Optimum Stable Diffusion Turbo SDXL Turbo Olive Intel Xeon SetFit Optimum-Intel

More like this (12)

OpenVINO Nx OpenML OpenVINO GenAI OpenAI Nonprofit OpenAI, Inc.Optimum-NVIDIA NextGenAI MLX OpenAI Nonprofit Commission OpenAI Foundation OpenAI o1-preview

Recent events (6)

4Hugging Face Blog·May 19, 2026·source ↗

Accelerated Inference with Optimum and Transformers Pipelines

Hugging Face announced integration between the Optimum library and the Transformers Pipelines API, enabling hardware-accelerated inference with minimal code changes. The integration targets deployment on specialized hardware backends such as ONNX Runtime, allowing users to swap in optimized inference engines transparently. This lowers the barrier to production-grade inference optimization for practitioners using the Hugging Face ecosystem.

Inference Economics Agent and Tool Ecosystem Optimum ONNX Transformers Pipelines +1 more

4Hugging Face Blog·May 19, 2026·source ↗

Convert Transformers to ONNX with Hugging Face Optimum

Hugging Face published a guide on converting Transformer models to ONNX format using the Optimum library. The post covers the tooling workflow for exporting models from the Transformers ecosystem into ONNX for optimized inference deployment. This is a practical infrastructure topic relevant to production ML deployment patterns.

Inference Economics Enterprise Deployment Patterns Transformers ONNX Hugging Face +1 more

4Hugging Face Blog·May 19, 2026·source ↗

Optimum + ONNX Runtime: Faster Training for Hugging Face Models

Hugging Face's Optimum library integrates with Microsoft's ONNX Runtime Training to accelerate fine-tuning of transformer models. The integration aims to reduce training time and memory usage with minimal code changes for practitioners using the Hugging Face ecosystem. This tooling update targets enterprise and research users looking to optimize training efficiency on existing hardware.

Training Infrastructure Agent and Tool Ecosystem Optimum Microsoft ONNX +1 more

5Hugging Face Blog·May 19, 2026·source ↗

Accelerating over 130,000 Hugging Face Models with ONNX Runtime

Hugging Face and Microsoft have integrated ONNX Runtime (ORT) to accelerate inference for over 130,000 models on the Hugging Face Hub. The integration enables optimized deployment across CPU and GPU hardware without requiring users to manually export or configure ONNX models. This represents a significant expansion of ORT's reach within the open-weights model ecosystem, lowering the barrier to production-grade inference optimization.

Open Weights Progress Inference Economics Optimum Microsoft ONNX +2 more

4Hugging Face Blog·May 19, 2026·source ↗

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

This Hugging Face blog post details how to accelerate Stable Diffusion Turbo and SDXL Turbo inference using ONNX Runtime and Microsoft's Olive optimization toolkit. The post covers the workflow for converting and optimizing diffusion models for faster deployment. This is a practical inference optimization guide targeting practitioners deploying image generation models.

Inference Economics Agent and Tool Ecosystem Stable Diffusion Turbo SDXL Turbo Microsoft +3 more

4Hugging Face Blog·May 19, 2026·source ↗

Blazing Fast SetFit Inference with Optimum Intel on Xeon

Hugging Face demonstrates accelerated inference for SetFit few-shot text classification models using Optimum Intel on Intel Xeon CPUs. The post covers optimization techniques such as quantization and ONNX export to improve throughput and latency for CPU-based deployment. This is relevant to practitioners deploying lightweight NLP models in cost-sensitive or edge environments without GPU hardware.

Inference Economics Enterprise Deployment Patterns ONNX Intel Xeon SetFit +2 more