Unsloth
unsloth-f36698fa·4 events·first seen 1mo agoAliases: Unsloth
Co-occurring entities
More like this (12)
Recent events (4)
Unsloth: Web UI and Library for Efficient Fine-tuning of Open Models
Unsloth is an open-source Python library and web UI (Unsloth Studio) for efficient fine-tuning and local inference of open-weight models including Gemma 4, Qwen3, DeepSeek, and GPT-OSS variants. The project has accumulated over 64,000 GitHub stars with continued daily growth (+139 today), indicating strong community adoption. It targets practitioners who want to train and run large models locally with reduced memory and compute requirements.
Make LLM Fine-tuning 2x faster with Unsloth and 🤗 TRL
Hugging Face published a blog post detailing an integration between Unsloth and TRL (Transformer Reinforcement Learning) library that claims to achieve 2x faster LLM fine-tuning. The post covers how Unsloth optimizes training kernels to reduce memory usage and increase throughput. This is relevant to practitioners looking to reduce compute costs and time for fine-tuning large language models.
Train AI Models with Unsloth and Hugging Face Jobs for Free
Hugging Face has published a blog post describing how to use Unsloth in combination with Hugging Face Jobs to fine-tune AI models at no cost. The post targets practitioners looking for accessible, low-cost training workflows. It highlights the integration between Unsloth's memory-efficient training optimizations and Hugging Face's job execution infrastructure.
torchtune: PyTorch Native Post-Training Library for LLMs
Meta's PyTorch team introduces torchtune, a PyTorch-native library for post-training LLMs that emphasizes modularity, hackability, and direct access to underlying PyTorch components. The library supports fine-tuning, experimentation, and deployment-oriented workflows across distributed training settings. Benchmarked against popular frameworks Axolotl and Unsloth, torchtune demonstrates competitive performance and memory efficiency while maintaining flexibility for research iteration. The paper presents design principles, model builders, training recipes, and distributed training stack details.