Entity · technique

NF4

techniqueactivenf4-fb82b186·1 events·first seen May 19, 2026

Aliases: NF4

Co-occurring entities

Hugging Face Transformers Hugging Face bitsandbytes GPTQ LLM.int8

More like this (12)

NF4 (NormalFloat4)NVFP4 NeRF NC-FFN NF-CoT FID Nx Inter4K FNet BFCLv3 GFNet BERT-F1

Recent events (1)

5Hugging Face Blog·May 19, 2026·source ↗

Overview of Natively Supported Quantization Schemes in 🤗 Transformers

This Hugging Face blog post surveys the quantization methods natively integrated into the Transformers library as of September 2023, covering schemes such as GPTQ, bitsandbytes (LLM.int8, NF4), and related techniques. It explains how each method works, their trade-offs in terms of memory reduction and inference speed, and how practitioners can apply them via the Transformers API. The post serves as a practical reference for deploying large language models under memory constraints.

Open Weights Progress Inference Economics NF4 Hugging Face Transformers Hugging Face +4 more