Almanac
technique

NF4 (NormalFloat4)

techniqueactivenf4-normalfloat4--71f4c6fc·1 events·first seen 28d ago

Aliases: NF4 (NormalFloat4)

Co-occurring entities

More like this (12)

Recent events (1)

6Hugging Face Blog·28d ago·source ↗

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Hugging Face published a blog post detailing the integration of 4-bit quantization via bitsandbytes into the Transformers library, enabling large language models to run on consumer-grade hardware. The post covers NF4 (NormalFloat4) data type and double quantization techniques from the QLoRA paper, which together reduce memory footprint significantly while preserving model quality. It demonstrates how users can load models like LLaMA in 4-bit precision and fine-tune them using QLoRA with minimal code changes.