Almanac
model

Mistral Large 2

modelactivemistral-large-2-93661385·12 events·first seen 1mo ago

Aliases: Mistral Large 2, Mistral Large 3, Mistral Large, Mistral-Large

Co-occurring entities

More like this (12)

Recent events (12)

8Mistral Ai News·1mo ago·source ↗

Mistral Releases Mistral 3 Family: Mistral Large 3 (675B MoE) and Ministral 3 Series (3B–14B), All Apache 2.0

Mistral AI has announced Mistral 3, a family of open-weight models including Mistral Large 3 (41B active / 675B total sparse MoE) and three dense Ministral 3 edge models (3B, 8B, 14B), all released under Apache 2.0. Mistral Large 3 debuts at #2 on LMArena's OSS non-reasoning leaderboard, supports image understanding, and was trained on 3,000 NVIDIA H200 GPUs; a reasoning variant is forthcoming. The Ministral 3 series includes base, instruct, and reasoning variants with multimodal and multilingual capabilities, with the 14B reasoning model achieving 85% on AIME '25. The release involves deep co-optimization with NVIDIA (Blackwell/Hopper kernels, NVFP4 format), vLLM, and Red Hat, and is available across major cloud and inference platforms.

8Mistral Ai News·15d ago·source ↗

Mistral Large 2 (123B): New Frontier Model with 128k Context, Multilingual and Code Capabilities

Mistral AI releases Mistral Large 2, a 123-billion-parameter model with a 128k context window, supporting 80+ coding languages and over a dozen natural languages. The model claims competitive performance with GPT-4o, Claude 3 Opus, and Llama 3 405B on code generation, reasoning, and multilingual benchmarks, while targeting cost-efficient single-node inference. Weights are available under a Mistral Research License for non-commercial use, with a commercial license required for self-deployment. The model is accessible via Mistral's la Plateforme API (mistral-large-2407), HuggingFace, and Google Cloud Vertex AI.

8Mistral Ai News·15d ago·source ↗

Mistral AI Releases Mistral Large, Claims Second-Best API Model After GPT-4

Mistral AI has released Mistral Large, its most capable model to date, claiming second place among API-accessible models behind GPT-4 on standard benchmarks including MMLU, HellaSwag, and coding/math evals. The model features a 32K context window, native fluency in five European languages, function calling, and constrained output mode. Simultaneously, Mistral is launching a new Mistral Small optimized for latency, restructuring its endpoint lineup, and announcing Microsoft Azure as its first major distribution partner. This marks Mistral's first significant commercial partnership and expansion beyond its own infrastructure.

6Mistral Ai News·15d ago·source ↗

Mistral AI Publishes First Comprehensive Lifecycle Analysis of LLM Environmental Footprint

Mistral AI has released what it claims is the first comprehensive lifecycle analysis (LCA) of an AI model, conducted in collaboration with Carbone 4 and French agency ADEME, covering greenhouse gas emissions, water use, and resource depletion. Key findings include Mistral Large 2 generating 20.4 ktCO₂e, 281,000 m³ of water, and 660 kg Sb eq over 18 months of training and usage, with a single 400-token Le Chat inference costing 1.14 gCO₂e and 45 mL of water. The study proposes three standardized reporting indicators for the industry and advocates for mandatory disclosure of training and inference environmental impacts. Mistral argues model size correlates roughly linearly with environmental footprint, emphasizing the importance of right-sizing model selection.

3Mistral Ai News·15d ago·source ↗

Mistral AI Demonstrates Agentic Workflow for Meeting-to-Dev-Ticket Automation

Mistral AI has published a solution blog describing a multi-agent workflow called TranscriptToPRDTicket that converts meeting transcripts into Product Requirements Documents and engineering tickets using two specialized agents (PRDAgent and TicketCreationAgent) both powered by Mistral Large 2. The pipeline integrates with project management tools such as Linear and Jira, and a full implementation is provided via a Google Colab notebook. The post is primarily a deployment-pattern showcase rather than a new model or capability announcement.

6Mistral Ai News·15d ago·source ↗

Mistral AI Announces Fine-Tuning for Flagship Models, Agents Alpha, and SDK 1.0

Mistral AI has announced three platform updates: fine-tuning support for all flagship and specialist models on La Plateforme (including Mistral Large 2 and Codestral), an alpha release of an Agents feature enabling custom workflows via Le Chat or API, and a stable 1.0 release of the mistralai Python and TypeScript SDK. Fine-tuning supports base prompts, few-shot prompting, and full fine-tuning with custom datasets. The Agents feature is described as early-stage, with tool and data-source integrations planned.

7Mistral Ai News·1mo ago·source ↗

Pixtral Large: Mistral AI's 124B Open-Weights Multimodal Model

Mistral AI released Pixtral Large, a 124B open-weights multimodal model built on Mistral Large 2, featuring a 1B parameter vision encoder and 128K context window supporting at least 30 high-resolution images. The model claims state-of-the-art results on MathVista, DocVQA, and ChartQA, outperforming GPT-4o and Gemini-1.5 Pro on several benchmarks, and leads the LMSys Vision Leaderboard among open-weights models by ~50 ELO points. Simultaneously, Mistral updated its text model to Mistral Large 24.11 with improvements in long-context understanding, function calling, and RAG/agentic workflows. Note: the model has since been deprecated and replaced by newer Mistral vision models.

7Mistral Ai News·1mo ago·source ↗

Mistral AI Launches Agents API with Built-in Connectors, MCP Tools, and Persistent Memory

Mistral AI has released a dedicated Agents API that extends beyond chat completion by providing built-in connectors for code execution, web search, image generation, and document retrieval, alongside support for Model Context Protocol (MCP) tools. The API features stateful conversation management with branching, streaming output, and multi-agent orchestration capabilities. Benchmark results show substantial web search augmentation gains: Mistral Large jumps from 23% to 75% on SimpleQA, and Mistral Medium from 22% to 82% with search enabled. The release targets enterprise-grade agentic workflows and is accompanied by cookbooks covering GitHub coding assistants, financial analysis, and travel planning use cases.

6Mistral Ai News·15d ago·source ↗

Mistral AI Launches Le Chat Conversational Assistant

Mistral AI has released Le Chat, a multilingual conversational assistant built on its own models including Mistral Large, Mistral Small, and a new prototype called Mistral Next. The product serves as both a public-facing demo of Mistral's capabilities and a business offering via Le Chat Enterprise, which includes self-deployment and fine-grained moderation. The assistant is currently in beta and lacks internet access. A tunable moderation mechanism is included to flag sensitive content.

7Mistral Ai News·15d ago·source ↗

Mistral AI Releases Mistral Small v24.09, Free API Tier, and Pixtral 12B Vision on le Chat with Broad Price Cuts

Mistral AI announced a multi-part release on September 17, 2024: a free tier for la Plateforme API, significant price reductions across its model family (up to 80% for Mistral Small and Codestral), an updated Mistral Small v24.09 (22B parameters, improved alignment and reasoning), and the availability of Pixtral 12B vision capabilities on le Chat. Pixtral 12B, released under Apache 2.0, supports images of any size without text performance degradation and is now accessible for free on le Chat. The pricing updates also apply to cloud partner deployments on Azure AI Studio, Amazon Bedrock, and Google Vertex AI.

6arXiv · cs.CL·7d ago·source ↗

The Shibboleth Effect: Cross-lingual behavioral skew in frontier LLMs under adversarial geopolitical simulation

Researchers introduce the 'Shibboleth Effect' — systematic behavioral differences in LLMs when operating in different languages — and audit six frontier models (GPT-4o, Llama-4, Mistral-Large, Gemini-3.1-Pro, Qwen3.6-Plus, DeepSeek-R1) using a synthetic maritime territorial dispute wargame played in English versus Turkish. Results are heterogeneous: Llama-4 becomes significantly more coercive in Turkish while Gemini-3.1-Pro and DeepSeek-R1 become less so, and GPT-4o shows no detectable shift. The study identifies two candidate buffering mechanisms — chain-of-thought institutional anchoring and multilingual RLHF alignment — with direct implications for deploying LLMs in diplomatic or crisis-management contexts.

7Mistral Ai News·15d ago·source ↗

Mistral AI Releases Ministral 3B and 8B Edge Models

Mistral AI has introduced two new small language models, Ministral 3B and Ministral 8B, targeting on-device and edge computing use cases. Both models support up to 128k context length and claim state-of-the-art performance in the sub-10B parameter category, outperforming comparable models from Google and Meta on internal benchmarks. Ministral 8B features an interleaved sliding-window attention mechanism for memory-efficient inference and is priced at $0.1/M tokens via API, while Ministral 3B is priced at $0.04/M tokens. Weights for Ministral 8B Instruct are available for research use, with commercial licensing available on request.