7Hugging Face Blog·1mo ago

StarCoder2 and The Stack v2

Hugging Face and BigCode released StarCoder2, a new family of open code language models trained on The Stack v2, a significantly expanded code dataset. The release includes multiple model sizes and represents a major update to the BigCode open-weights code model lineage. The Stack v2 is a new large-scale permissively licensed code dataset used for training.

Training Infrastructure Open Weights Progress Agent and Tool Ecosystem BigCode The Stack v2 Hugging Face StarCoder2

Related guides (4)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How the Infrastructure Layer Around LLMs Is Consolidating

Read asIn-depth

Training InfrastructureTopic guide

Training Infrastructure: The Compute Arms Race Powering Modern AI

Read asBeginner In-depth

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

StarCoder: A State-of-the-Art LLM for Code

Hugging Face and ServiceNow released StarCoder, a large language model for code trained on permissively licensed data from The Stack dataset. The model targets code generation, completion, and understanding tasks and is positioned as an open-weights alternative to proprietary code models. The release includes model weights, training details, and an associated technical report.

Open Weights Progress Agent and Tool Ecosystem ServiceNow AI BigCode The Stack v2 +2 more

5Hugging Face Blog·1mo ago·source ↗

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Hugging Face introduces StarCoder2-Instruct, a code generation model fine-tuned via a self-alignment approach that requires no human-annotated instruction data. The method uses the base model itself to generate synthetic instruction-response pairs, which are then filtered and used for supervised fine-tuning. The model and all training data, pipelines, and evaluation code are released under permissive licenses, making it one of the more transparent instruction-tuned code models available.

Open Weights Progress Agent and Tool Ecosystem BigCode StarCoder2-Instruct Self-Instruct +3 more

5Hugging Face Blog·1mo ago·source ↗

Creating a Coding Assistant with StarCoder

This Hugging Face blog post describes the process of building StarChat-Alpha, a conversational coding assistant fine-tuned from the StarCoder large language model. The post covers the instruction-tuning methodology used to adapt StarCoder for chat-style interactions, including dataset preparation and training details. It represents an early example of open-weights coding LLMs being adapted into assistant-style deployments.

Open Weights Progress Agent and Tool Ecosystem BigCode Hugging Face StarCoder2 +2 more

7Mistral Ai News·19d ago·source ↗

Mistral AI Releases Codestral: 22B Open-Weight Code Generation Model

Mistral AI has released Codestral, a 22B open-weight model explicitly designed for code generation, supporting 80+ programming languages with a 32k context window. The model is available under a non-production license on HuggingFace, with commercial licenses available on request, and is accessible via a dedicated API endpoint (codestral.mistral.ai) free during an 8-week beta. Codestral claims state-of-the-art performance on RepoBench, HumanEval, and fill-in-the-middle benchmarks, outperforming DeepSeek Coder 33B and matching or exceeding GPT-4-Turbo on some language-specific evals. Integrations are available with LlamaIndex, LangChain, Continue.dev, and Tabnine for IDE-based developer workflows.

Frontier Model Releases Evaluation and Benchmarking Mistral AI LlamaIndex GPT-4 Turbo +17 more

7Hugging Face Blog·1mo ago·source ↗

Code Llama: Llama 2 learns to code

Meta released Code Llama, a family of code-specialized large language models built on top of Llama 2. The models are available in multiple sizes and variants, including a Python-specialized version and an instruction-following version. Code Llama supports long context windows for handling large codebases and is released as open weights, making it accessible for research and commercial use.

Long Context Evolution Frontier Model Releases Code Llama Llama 2 Hugging Face +3 more

7Mistral Ai News·19d ago·source ↗

Mistral Announces Codestral 25.08 and Integrated Enterprise Coding Stack

Mistral AI has released Codestral 25.08, a code generation model update claiming +30% accepted completions, 50% fewer runaway generations, and improved FIM benchmark performance. The announcement also frames a full enterprise coding stack comprising Codestral (completion), Codestral Embed (code-specific retrieval), and Devstral (agentic workflows via OpenHands), all deployable on-prem or in VPC environments. Devstral Medium is reported to achieve 61.6% on SWE-Bench Verified, while Devstral Small (24B, Apache-2.0) reaches 53.6%. The pitch targets regulated industries blocked by SaaS-only competitors through self-hostable, air-gapped deployment options.

Frontier Model Releases Evaluation and Benchmarking Devstral 2 Small Fill-in-the-Middle (FIM)Mistral AI +13 more

8Hugging Face Blog·1mo ago·source ↗

Llama 2 is here - get it on Hugging Face

Meta released Llama 2, a new family of open-weights large language models, made available through Hugging Face. The release includes both base and fine-tuned chat variants across multiple parameter sizes. This represents a significant expansion of accessible open-weights frontier models, with Meta and Microsoft partnering on distribution.

Frontier Model Releases Open Weights Progress Microsoft Llama 2 Hugging Face +2 more

6Hugging Face Blog·1mo ago·source ↗

CodeGemma - Google's Official Code-Focused LLM Release

Google has released CodeGemma, a family of code-specialized large language models, announced via the Hugging Face blog. CodeGemma builds on the Gemma model family and is targeted at code generation and understanding tasks. The release represents Google's continued push into open-weights code LLMs to compete with models like Code Llama and DeepSeek Coder.

Frontier Model Releases Open Weights Progress Gemma Code Llama Google +4 more