Mistral Launches Batch API at 50% Cost Reduction
Mistral AI has released a batch API on La Plateforme that processes high-volume asynchronous requests at 50% lower cost than synchronous API calls. Users upload a batch file, wait for processing, then download results. The API supports all models on La Plateforme with a limit of 1 million ongoing requests per workspace, and is positioned as a response to recent API price hikes from competitors.
Related guides (3)
Related events (8)
Mistral Medium 3: Frontier-Class Performance at 8x Lower Cost
Mistral AI has released Mistral Medium 3, a new enterprise-focused language model priced at $0.4/$2 per million input/output tokens. The model claims to achieve 90%+ of Claude Sonnet 3.7's benchmark performance while undercutting cost leaders like DeepSeek v3, and outperforming open models including Llama 4 Maverick. It supports hybrid, on-premises, and in-VPC deployment on as few as four GPUs, and is available immediately on Mistral La Plateforme and Amazon SageMaker, with additional cloud platforms coming soon. The announcement also teases an upcoming large open-weights model release.
Mistral AI Releases Mistral Small v24.09, Free API Tier, and Pixtral 12B Vision on le Chat with Broad Price Cuts
Mistral AI announced a multi-part release on September 17, 2024: a free tier for la Plateforme API, significant price reductions across its model family (up to 80% for Mistral Small and Codestral), an updated Mistral Small v24.09 (22B parameters, improved alignment and reasoning), and the availability of Pixtral 12B vision capabilities on le Chat. Pixtral 12B, released under Apache 2.0, supports images of any size without text performance degradation and is now accessible for free on le Chat. The pricing updates also apply to cloud partner deployments on Azure AI Studio, Amazon Bedrock, and Google Vertex AI.
Mistral AI Launches La Plateforme: First API Endpoints in Early Access
Mistral AI opened beta access to its first developer platform, La Plateforme, offering three generative text endpoints (mistral-tiny, mistral-small, mistral-medium) and an embedding endpoint. Mistral-tiny serves Mistral 7B Instruct v0.2, mistral-small serves Mixtral 8x7B, and mistral-medium serves an unreleased prototype model scoring 8.6 on MT-Bench. The platform also introduces Mistral-embed with a 1024-dimension embedding model achieving 55.26 on MTEB. The API follows OpenAI-compatible chat interface specifications and is ramping toward general availability.
Mistral AI Launches Workflows: Enterprise AI Orchestration Layer in Public Preview
Mistral AI has released Workflows in public preview, an enterprise-grade orchestration layer integrated into its Studio platform that enables durable, observable, fault-tolerant AI pipeline execution in production. The system supports human-in-the-loop approvals via a single API call, full execution tracing with OpenTelemetry, and Python-based workflow authoring that publishes to Le Chat for non-developer triggering. Early enterprise customers including ASML, ABANCA, CMA-CGM, and La Banque Postale are already using it for cargo release automation, KYC compliance, and customer support triage. The product targets the gap between proof-of-concept AI pipelines and reliable production deployment.
Mistral OCR 3: New Frontier in Document Processing Accuracy and Efficiency
Mistral AI has released Mistral OCR 3 (model ID: mistral-ocr-2512), claiming a 74% overall win rate over its predecessor Mistral OCR 2 across forms, scanned documents, complex tables, and handwriting. The model supports markdown output with HTML-based table reconstruction and is priced at $2 per 1,000 pages ($1 with Batch API). It now powers the Document AI Playground in Mistral AI Studio, offering a drag-and-drop interface for parsing PDFs and images into text or structured JSON.
Mistral OCR: New Document Understanding API with State-of-the-Art Benchmark Performance
Mistral AI has released Mistral OCR, an Optical Character Recognition API designed for deep document understanding, handling text, tables, equations, images, and complex layouts from PDFs and images. The model claims top benchmark scores across math, multilingual, scanned, and table categories, outperforming Google Document AI, Azure OCR, Gemini 1.5/2.0, and GPT-4o on an internal test set. It is priced at 1000 pages per dollar (with batch inference doubling that), available via la Plateforme API today, and is already deployed as the default document understanding model in Le Chat. A selective self-hosting option is offered for organizations with sensitive data requirements.
Mistral AI Launches Agents API with Built-in Connectors, MCP Tools, and Persistent Memory
Mistral AI has released a dedicated Agents API that extends beyond chat completion by providing built-in connectors for code execution, web search, image generation, and document retrieval, alongside support for Model Context Protocol (MCP) tools. The API features stateful conversation management with branching, streaming output, and multi-agent orchestration capabilities. Benchmark results show substantial web search augmentation gains: Mistral Large jumps from 23% to 75% on SimpleQA, and Mistral Medium from 22% to 82% with search enabled. The release targets enterprise-grade agentic workflows and is accompanied by cookbooks covering GitHub coding assistants, financial analysis, and travel planning use cases.
Mistral Launches Medium 3.5 (128B Open Weights), Remote Cloud Coding Agents in Vibe, and Work Mode in Le Chat
Mistral AI has released Mistral Medium 3.5, a 128B dense open-weights model with a 256k context window, configurable reasoning effort, and a vision encoder trained from scratch, scoring 77.6% on SWE-Bench Verified. Alongside the model, Mistral is launching remote cloud-based coding agents in its Vibe CLI and Le Chat interface, enabling async parallel coding sessions that run independently and notify users on completion. A new Work mode in Le Chat provides a multi-step agentic interface for cross-tool workflows including email, calendar, research, and issue tracking. Mistral Medium 3.5 replaces Devstral 2 as the default model in both Le Chat and the Vibe CLI, and is available for self-hosting on as few as four GPUs under a modified MIT license.


