Almanac
model

LLaVA OneVision 72B

modelactivellava-onevision-72b-94b9a7b6·1 events·first seen 1mo ago

Aliases: LLaVA OneVision 72B

Co-occurring entities

More like this (12)

Recent events (1)

5Mistral Ai News·1mo ago·source ↗

Pixtral 12B: Mistral AI's First Multimodal Model (Now Deprecated)

Mistral AI released Pixtral 12B in September 2024 as their first natively multimodal model, combining a new 400M parameter vision encoder trained from scratch with a 12B multimodal decoder based on Mistral Nemo. The model supports variable image sizes and aspect ratios, a 128K token context window for multiple images, and achieved 52.5% on MMMU while maintaining strong text-only benchmark performance. The model is now deprecated and has been replaced by newer vision and multimodal models from Mistral. It was released under Apache 2.0 license.