Almanac
product

AutoGPTQ

productactiveautogptq-7c4d5e55·1 events·first seen 28d ago

Aliases: AutoGPTQ

Co-occurring entities

More like this (12)

Recent events (1)

6Hugging Face Blog·28d ago·source ↗

Making LLMs lighter with AutoGPTQ and transformers

Hugging Face announces native integration of AutoGPTQ into the transformers library, enabling 4-bit quantized inference for large language models. The integration allows users to load and run GPTQ-quantized models directly through the standard transformers API with minimal code changes. This lowers the hardware barrier for deploying LLMs by significantly reducing VRAM requirements while maintaining competitive performance.