Entity · model

deepseek-coder

modelactivedeepseek-coder-114b17aa·5 events·first seen May 18, 2026

Aliases: deepseek-coder, DeepSeek Coder, DeepSeek Coder 33B, DeepSeek-Coder-1.3B

Co-occurring entities

More like this (12)

DeepSeek-Coder-V2-0724 DeepSeek Coder V2 lite DeepSeek-Coder-V1-6.7B DeepSeek API DeepSeek-V3.1-Base DeepSeek V4 DeepSeek-V2.5-1210 DeepSeek-V3-0324 DeepSeek-V3.2-Speciale DeepSeek-V4-Flash DeepSeek-R1-0528 DeepSeek-Prover-V2-7B

Recent events (5)

5arXiv · cs.CL·Jun 16, 2026·source ↗

Post-hoc falsification operators for frozen small code models fail to beat Best-of-N in leakage-free evaluation

A measurement study evaluates 26 post-hoc operators (selection, verification, repair, elimination, portfolios) applied to frozen small code models (≤1.5B parameters) against a Best-of-N baseline under a strict leakage-free, matched-compute protocol. None of the semantic operators improves held-out accuracy over BoN, with the failure traced to three structural mechanisms: a coverage wall, a capability scissors, and a near-empty consensus trap. Two non-semantic operators do provide value: an expression-layer recovery method (M1) lifts DeepSeek-Coder-1.3B by +12 tasks on HumanEval+ (p=2.4e-4), and an adaptive consensus early-stop saves ~19% compute with no accuracy harm. The paper's core lesson is that harness quality and coverage measurement should precede investment in semantic post-hoc reasoning.

Evaluation and Benchmarking Inference Economics Selection Without Signal, Recovery Through Expression: A Measurement Study of Post-Hoc Falsification Operators for Frozen Small Code Models deepseek-coder Best-of-N +2 more

7Mistral Ai News·Jun 1, 2026·source ↗

Mistral AI Releases Codestral: 22B Open-Weight Code Generation Model

Mistral AI has released Codestral, a 22B open-weight model explicitly designed for code generation, supporting 80+ programming languages with a 32k context window. The model is available under a non-production license on HuggingFace, with commercial licenses available on request, and is accessible via a dedicated API endpoint (codestral.mistral.ai) free during an 8-week beta. Codestral claims state-of-the-art performance on RepoBench, HumanEval, and fill-in-the-middle benchmarks, outperforming DeepSeek Coder 33B and matching or exceeding GPT-4-Turbo on some language-specific evals. Integrations are available with LlamaIndex, LangChain, Continue.dev, and Tabnine for IDE-based developer workflows.

Frontier Model Releases Evaluation and Benchmarking Mistral AI LlamaIndex GPT-4 Turbo +17 more

7Mistral Ai News·Jun 1, 2026·source ↗

Codestral 25.01: Mistral AI Releases Updated Coding Model with 2x Speed and Improved FIM Performance

Mistral AI has released Codestral 25.01, a significant upgrade to its Codestral coding model featuring a more efficient architecture and improved tokenizer that generates code approximately 2x faster than its predecessor. The model claims state-of-the-art performance for fill-in-the-middle (FIM) tasks across sub-100B parameter models, with a 256k context window and support for 80+ programming languages. Benchmarks show improvements over Codestral 2405 and competitive or superior results against DeepSeek Coder V2 lite and DeepSeek Coder 33B on HumanEval and FIM metrics. The model is available via Mistral's API, IDE plugins (VS Code, JetBrains via Continue), and for on-premises/VPC deployment, with cloud availability on Vertex AI and Azure AI Foundry.

Frontier Model Releases Evaluation and Benchmarking Mistral AI HumanEvalFIM Azure Foundry +12 more

6Hugging Face Blog·May 19, 2026·source ↗

CodeGemma - Google's Official Code-Focused LLM Release

Google has released CodeGemma, a family of code-specialized large language models, announced via the Hugging Face blog. CodeGemma builds on the Gemma model family and is targeted at code generation and understanding tasks. The release represents Google's continued push into open-weights code LLMs to compete with models like Code Llama and DeepSeek Coder.

Frontier Model Releases Open Weights Progress Gemma Code Llama Google +4 more

6Deepseek News·May 18, 2026·source ↗

DeepSeek API Major Upgrade: Function Calling, FIM, Chat Prefix Completion, JSON Output, and 8K Token Limit

DeepSeek has released a significant API update adding Function Calling (up to 128 parallel calls, OpenAI-compatible), JSON Output, Chat Prefix Completion, and FIM (Fill-In-the-Middle) Completion to both deepseek-chat and deepseek-coder models. The update also raises the max_tokens ceiling to 8K in the Beta API. Several features are in Beta and will be open-sourced once stable. The Function Calling and JSON Output implementations are explicitly designed to be compatible with the OpenAI API.

Open Weights Progress Inference Economics DeepSeek V4 FIM Completion deepseek-chat +5 more