product
AutoGPTQ
productactive
autogptq-7c4d5e55·1 events·first seen 28d agoAliases: AutoGPTQ
Co-occurring entities
More like this (12)
Recent events (1)
Making LLMs lighter with AutoGPTQ and transformers
Hugging Face announces native integration of AutoGPTQ into the transformers library, enabling 4-bit quantized inference for large language models. The integration allows users to load and run GPTQ-quantized models directly through the standard transformers API with minimal code changes. This lowers the hardware barrier for deploying LLMs by significantly reducing VRAM requirements while maintaining competitive performance.