Almanac
model

Pixtral 12B

modelactivepixtral-12b-23213480·4 events·first seen 1mo ago

Aliases: Pixtral 12B, Pixtral-12B

Co-occurring entities

More like this (12)

Recent events (4)

5Mistral Ai News·1mo ago·source ↗

Pixtral 12B: Mistral AI's First Multimodal Model (Now Deprecated)

Mistral AI released Pixtral 12B in September 2024 as their first natively multimodal model, combining a new 400M parameter vision encoder trained from scratch with a 12B multimodal decoder based on Mistral Nemo. The model supports variable image sizes and aspect ratios, a 128K token context window for multiple images, and achieved 52.5% on MMMU while maintaining strong text-only benchmark performance. The model is now deprecated and has been replaced by newer vision and multimodal models from Mistral. It was released under Apache 2.0 license.

7Mistral Ai News·15d ago·source ↗

Mistral AI Releases Mistral Small v24.09, Free API Tier, and Pixtral 12B Vision on le Chat with Broad Price Cuts

Mistral AI announced a multi-part release on September 17, 2024: a free tier for la Plateforme API, significant price reductions across its model family (up to 80% for Mistral Small and Codestral), an updated Mistral Small v24.09 (22B parameters, improved alignment and reasoning), and the availability of Pixtral 12B vision capabilities on le Chat. Pixtral 12B, released under Apache 2.0, supports images of any size without text performance degradation and is now accessible for free on le Chat. The pricing updates also apply to cloud partner deployments on Azure AI Studio, Amazon Bedrock, and Google Vertex AI.

4Mistral Ai News·15d ago·source ↗

Mistral AI Demonstrates Pixtral-12B Fine-Tuning on Satellite Imagery via LoRA

Mistral AI published a technical case study showing how fine-tuning Pixtral-12B using LoRA on the Aerial Image Dataset (AID) significantly improves satellite image classification over the base model. The post details the fine-tuning workflow via Mistral's API and LaPlateforme UI, covering hyperparameter selection and structured output enforcement. Key improvements include better handling of ambiguous scene categories (e.g., Playground vs. Stadium) and reduced hallucination of invalid class labels. The article positions domain-specific fine-tuning as a practical bridge between general-purpose vision-language models and specialized geospatial applications.

7Mistral Ai News·15d ago·source ↗

Mistral AI Launches Major le Chat Update with Web Search, Canvas, Pixtral Large, and Image Generation

Mistral AI has announced a significant expansion of its le Chat assistant with several new capabilities in beta: web search with citations, a Canvas interface for collaborative document and code creation, multimodal document and image understanding powered by the new Pixtral Large model, and image generation via a partnership with Black Forest Labs (Flux Pro). The update also introduces shareable task agents for workflow automation and speculative editing for faster responses. All new features are currently offered on a free tier, positioning le Chat as a direct competitor to ChatGPT, Claude, and Perplexity.