Almanac
product

PEFT

productactivepeft-bae1d3ca·7 events·first seen 28d ago

Aliases: PEFT

Co-occurring entities

More like this (12)

Recent events (7)

6Hugging Face Blog·28d ago·source ↗

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Hugging Face introduces the PEFT library, which enables parameter-efficient fine-tuning of large language models using techniques such as LoRA, prefix tuning, and prompt tuning. The library allows practitioners to adapt large pretrained models to downstream tasks while updating only a small fraction of model parameters, dramatically reducing compute and memory requirements. This lowers the barrier to fine-tuning frontier-scale models on consumer hardware.

5Hugging Face Blog·28d ago·source ↗

🤗 PEFT Welcomes New Merging Methods

Hugging Face's PEFT library has added new methods for merging parameter-efficient fine-tuned adapters (e.g., LoRA). The update enables combining multiple fine-tuned adapters into a single model, expanding the toolkit for practitioners working with adapter-based fine-tuning. This is a tooling update relevant to the growing ecosystem of efficient fine-tuning and model composition workflows.

6Hugging Face Blog·28d ago·source ↗

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Hugging Face demonstrates a method for running RLHF fine-tuning on 20-billion-parameter language models using a single 24GB consumer GPU by combining TRL and PEFT (parameter-efficient fine-tuning). The approach uses techniques like LoRA and quantization to dramatically reduce memory requirements. This lowers the hardware barrier for RLHF experimentation from multi-GPU server setups to consumer-grade hardware.

4Hugging Face Blog·28d ago·source ↗

Fast LoRA inference for Flux with Diffusers and PEFT

Hugging Face published a technical blog post detailing optimizations for LoRA inference speed with the Flux image generation model using the Diffusers and PEFT libraries. The post covers techniques to accelerate adapter loading and inference throughput for diffusion models. This is relevant to practitioners deploying fine-tuned image generation models in production or research settings.

4Hugging Face Blog·28d ago·source ↗

Fine-Tuning Gemma Models in Hugging Face

Hugging Face published a guide on fine-tuning Google's Gemma models using parameter-efficient fine-tuning (PEFT) techniques. The post covers practical workflows for adapting Gemma to downstream tasks within the Hugging Face ecosystem. This represents part of the broader tooling support rollout accompanying Gemma's release in February 2024.

4Hugging Face Blog·28d ago·source ↗

Personal Copilot: Train Your Own Coding Assistant

This Hugging Face blog post walks through fine-tuning an open-weights code model to create a personalized coding assistant. It covers dataset preparation, training techniques (likely LoRA/PEFT), and deployment considerations for self-hosted code completion. The post targets practitioners who want a GitHub Copilot-like experience without relying on proprietary APIs.

6Hugging Face Blog·28d ago·source ↗

GaLore: Advancing Large Model Training on Consumer-grade Hardware

GaLore (Gradient Low-Rank Projection) is a memory-efficient training technique that reduces optimizer state memory by projecting gradients into a low-rank subspace during training, enabling large model training on consumer-grade hardware. The Hugging Face blog post covers integration of GaLore into the transformers and peft ecosystems. Unlike LoRA, GaLore applies low-rank projection to the full training process rather than constraining weight updates, allowing full-parameter learning with reduced memory footprint. This makes training models like LLaMA-7B feasible on single consumer GPUs.