
Mistral-medium
mistral-medium-b9b8839b·7 events·first seen 1mo agoAliases: Mistral-medium, Mistral Medium 3.5, Mistral Medium 3.1, Mistral Medium, Mistral Medium 3
Co-occurring entities
More like this (12)
Recent events (7)
Mistral Launches Medium 3.5 (128B Open Weights), Remote Cloud Coding Agents in Vibe, and Work Mode in Le Chat
Mistral AI has released Mistral Medium 3.5, a 128B dense open-weights model with a 256k context window, configurable reasoning effort, and a vision encoder trained from scratch, scoring 77.6% on SWE-Bench Verified. Alongside the model, Mistral is launching remote cloud-based coding agents in its Vibe CLI and Le Chat interface, enabling async parallel coding sessions that run independently and notify users on completion. A new Work mode in Le Chat provides a multi-step agentic interface for cross-tool workflows including email, calendar, research, and issue tracking. Mistral Medium 3.5 replaces Devstral 2 as the default model in both Le Chat and the Vibe CLI, and is available for self-hosting on as few as four GPUs under a modified MIT license.
Mistral Medium 3: Frontier-Class Performance at 8x Lower Cost
Mistral AI has released Mistral Medium 3, a new enterprise-focused language model priced at $0.4/$2 per million input/output tokens. The model claims to achieve 90%+ of Claude Sonnet 3.7's benchmark performance while undercutting cost leaders like DeepSeek v3, and outperforming open models including Llama 4 Maverick. It supports hybrid, on-premises, and in-VPC deployment on as few as four GPUs, and is available immediately on Mistral La Plateforme and Amazon SageMaker, with additional cloud platforms coming soon. The announcement also teases an upcoming large open-weights model release.
Mistral AI Launches Le Chat Enterprise with Mistral Medium 3
Mistral AI has introduced Le Chat Enterprise, a feature-rich AI assistant platform powered by the newly announced Mistral Medium 3 model, targeting enterprise AI adoption challenges such as tool fragmentation and insecure knowledge integration. The platform includes enterprise search with connectors to Google Drive, SharePoint, OneDrive, Gmail, and Google Calendar, plus agent builders, document libraries, custom model support, and hybrid deployment options. Le Chat Enterprise is available now on Google Cloud Marketplace, with Azure AI and AWS Marketplace listings forthcoming. Mistral also announced improvements to its Le Chat Pro and Team plans.
Mistral AI Launches La Plateforme: First API Endpoints in Early Access
Mistral AI opened beta access to its first developer platform, La Plateforme, offering three generative text endpoints (mistral-tiny, mistral-small, mistral-medium) and an embedding endpoint. Mistral-tiny serves Mistral 7B Instruct v0.2, mistral-small serves Mixtral 8x7B, and mistral-medium serves an unreleased prototype model scoring 8.6 on MT-Bench. The platform also introduces Mistral-embed with a 1024-dimension embedding model achieving 55.26 on MTEB. The API follows OpenAI-compatible chat interface specifications and is ramping toward general availability.
Mistral AI Engineering Deep Dive: Debugging a Memory Leak in vLLM
Mistral AI's engineering team investigated a memory leak in vLLM that appeared exclusively during disaggregated prefill/decode serving with Mistral Medium 3.1 and graph compilation enabled, causing ~400 MB/min RSS growth. The leak was not visible in heap profilers (Memray, Guppy3, Heaptrack), pointing to off-heap memory allocation tied to NIXL/UCX-based KV cache transfer over InfiniBand. The post is the first in a new Engineering Deep Dive series and documents a methodical descent from Python-level tools to kernel-level tracing to isolate the root cause.
Mistral AI Launches Agents API with Built-in Connectors, MCP Tools, and Persistent Memory
Mistral AI has released a dedicated Agents API that extends beyond chat completion by providing built-in connectors for code execution, web search, image generation, and document retrieval, alongside support for Model Context Protocol (MCP) tools. The API features stateful conversation management with branching, streaming output, and multi-agent orchestration capabilities. Benchmark results show substantial web search augmentation gains: Mistral Large jumps from 23% to 75% on SimpleQA, and Mistral Medium from 22% to 82% with search enabled. The release targets enterprise-grade agentic workflows and is accompanied by cookbooks covering GitHub coding assistants, financial analysis, and travel planning use cases.
Mistral AI Launches Mistral Code: Enterprise AI Coding Assistant with On-Prem Deployment
Mistral AI has announced Mistral Code, an enterprise-grade AI coding assistant currently in private beta for JetBrains IDEs and VSCode. The product bundles four specialized models (Codestral, Codestral Embed, Devstral, Mistral Medium) with an IDE plugin, admin controls, and deployment options ranging from serverless to air-gapped on-premises GPUs. It is built on a fork of the open-source Continue project with enterprise additions including RBAC, audit logging, and fine-tuning on private repositories. Early enterprise adopters include Abanca, SNCF (4,000 developers), and Capgemini (1,500+ developers).