5Mistral AI News·20d ago

Mistral Launches Batch API at 50% Cost Reduction

Mistral AI has released a batch API on La Plateforme that processes high-volume asynchronous requests at 50% lower cost than synchronous API calls. Users upload a batch file, wait for processing, then download results. The API supports all models on La Plateforme with a limit of 1 million ongoing requests per workspace, and is positioned as a response to recent API price hikes from competitors.

Inference Economics Enterprise Deployment Patterns Mistral Batch API Mistral AI La Plateforme

Related guides (3)

Mistral AI

Mistral AI: Europe's Open-Weights AI Challenger

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

7Mistral Ai News·20d ago·source ↗

Mistral Medium 3: Frontier-Class Performance at 8x Lower Cost

Mistral AI has released Mistral Medium 3, a new enterprise-focused language model priced at $0.4/$2 per million input/output tokens. The model claims to achieve 90%+ of Claude Sonnet 3.7's benchmark performance while undercutting cost leaders like DeepSeek v3, and outperforming open models including Llama 4 Maverick. It supports hybrid, on-premises, and in-VPC deployment on as few as four GPUs, and is available immediately on Mistral La Plateforme and Amazon SageMaker, with additional cloud platforms coming soon. The announcement also teases an upcoming large open-weights model release.

Frontier Model Releases Open Weights Progress Mistral AI Amazon SageMaker DeepSeek V4 +11 more

7Mistral Ai News·20d ago·source ↗

Mistral AI Releases Mistral Small v24.09, Free API Tier, and Pixtral 12B Vision on le Chat with Broad Price Cuts

Mistral AI announced a multi-part release on September 17, 2024: a free tier for la Plateforme API, significant price reductions across its model family (up to 80% for Mistral Small and Codestral), an updated Mistral Small v24.09 (22B parameters, improved alignment and reasoning), and the availability of Pixtral 12B vision capabilities on le Chat. Pixtral 12B, released under Apache 2.0, supports images of any size without text performance degradation and is now accessible for free on le Chat. The pricing updates also apply to cloud partner deployments on Azure AI Studio, Amazon Bedrock, and Google Vertex AI.

Frontier Model Releases Open Weights Progress Mistral AI Amazon Bedrock Apache 2.0 +14 more

7Mistral Ai News·1mo ago·source ↗

Mistral AI Launches La Plateforme: First API Endpoints in Early Access

Mistral AI opened beta access to its first developer platform, La Plateforme, offering three generative text endpoints (mistral-tiny, mistral-small, mistral-medium) and an embedding endpoint. Mistral-tiny serves Mistral 7B Instruct v0.2, mistral-small serves Mixtral 8x7B, and mistral-medium serves an unreleased prototype model scoring 8.6 on MT-Bench. The platform also introduces Mistral-embed with a 1024-dimension embedding model achieving 55.26 on MTEB. The API follows OpenAI-compatible chat interface specifications and is ramping toward general availability.

Frontier Model Releases Open Weights Progress MTEB Mistral AI MT-Bench +11 more

6Mistral Ai News·1mo ago·source ↗

Mistral AI Launches Workflows: Enterprise AI Orchestration Layer in Public Preview

Mistral AI has released Workflows in public preview, an enterprise-grade orchestration layer integrated into its Studio platform that enables durable, observable, fault-tolerant AI pipeline execution in production. The system supports human-in-the-loop approvals via a single API call, full execution tracing with OpenTelemetry, and Python-based workflow authoring that publishes to Le Chat for non-developer triggering. Early enterprise customers including ASML, ABANCA, CMA-CGM, and La Banque Postale are already using it for cargo release automation, KYC compliance, and customer support triage. The product targets the gap between proof-of-concept AI pipelines and reliable production deployment.

Inference Economics Enterprise Deployment Patterns Mistral AI Mistral Workflows OpenTelemetry +6 more

6Mistral Ai News·1mo ago·source ↗

Mistral OCR 3: New Frontier in Document Processing Accuracy and Efficiency

Mistral AI has released Mistral OCR 3 (model ID: mistral-ocr-2512), claiming a 74% overall win rate over its predecessor Mistral OCR 2 across forms, scanned documents, complex tables, and handwriting. The model supports markdown output with HTML-based table reconstruction and is priced at $2 per 1,000 pages ($1 with Batch API). It now powers the Document AI Playground in Mistral AI Studio, offering a drag-and-drop interface for parsing PDFs and images into text or structured JSON.

Inference Economics Enterprise Deployment Patterns Mistral AI Document AI Playground Mistral Studio +2 more

7Mistral Ai News·20d ago·source ↗

Mistral OCR: New Document Understanding API with State-of-the-Art Benchmark Performance

Mistral AI has released Mistral OCR, an Optical Character Recognition API designed for deep document understanding, handling text, tables, equations, images, and complex layouts from PDFs and images. The model claims top benchmark scores across math, multilingual, scanned, and table categories, outperforming Google Document AI, Azure OCR, Gemini 1.5/2.0, and GPT-4o on an internal test set. It is priced at 1000 pages per dollar (with batch inference doubling that), available via la Plateforme API today, and is already deployed as the default document understanding model in Le Chat. A selective self-hosting option is offered for organizations with sensitive data requirements.

Inference Economics Enterprise Deployment Patterns Mistral AI Azure OCR Gemini 1.5 Pro +8 more

7Mistral Ai News·1mo ago·source ↗

Mistral AI Launches Agents API with Built-in Connectors, MCP Tools, and Persistent Memory

Mistral AI has released a dedicated Agents API that extends beyond chat completion by providing built-in connectors for code execution, web search, image generation, and document retrieval, alongside support for Model Context Protocol (MCP) tools. The API features stateful conversation management with branching, streaming output, and multi-agent orchestration capabilities. Benchmark results show substantial web search augmentation gains: Mistral Large jumps from 23% to 75% on SimpleQA, and Mistral Medium from 22% to 82% with search enabled. The release targets enterprise-grade agentic workflows and is accompanied by cookbooks covering GitHub coding assistants, financial analysis, and travel planning use cases.

Frontier Model Releases Inference Economics Mistral AI GitHub Devstral 2 +9 more

8Mistral Ai News·1mo ago·source ↗

Mistral Launches Medium 3.5 (128B Open Weights), Remote Cloud Coding Agents in Vibe, and Work Mode in Le Chat

Mistral AI has released Mistral Medium 3.5, a 128B dense open-weights model with a 256k context window, configurable reasoning effort, and a vision encoder trained from scratch, scoring 77.6% on SWE-Bench Verified. Alongside the model, Mistral is launching remote cloud-based coding agents in its Vibe CLI and Le Chat interface, enabling async parallel coding sessions that run independently and notify users on completion. A new Work mode in Le Chat provides a multi-step agentic interface for cross-tool workflows including email, calendar, research, and issue tracking. Mistral Medium 3.5 replaces Devstral 2 as the default model in both Le Chat and the Vibe CLI, and is available for self-hosting on as few as four GPUs under a modified MIT license.

Long Context Evolution Frontier Model Releases Mistral AI Qwen3.5 397B A17B Devstral 2 +10 more