model
Codestral Mamba
modelactiveprovisional
codestral-mamba-157a0a2b·1 events·first seen 15d agoAliases: Codestral Mamba
Co-occurring entities
More like this (12)
Recent events (1)
Codestral Mamba: Mistral AI Releases Apache 2.0 Mamba-Architecture Code Model
Mistral AI has released Codestral Mamba, a 7.3B-parameter code-focused language model built on the Mamba state-space architecture rather than the Transformer architecture. The model offers linear-time inference and theoretically infinite sequence length, tested up to 256k tokens in-context retrieval. Developed with Mamba co-creators Albert Gu and Tri Dao, it is released under Apache 2.0 and available via HuggingFace, mistral-inference SDK, TensorRT-LLM, and Mistral's la Plateforme API. Mistral positions it as a local code assistant that performs on par with state-of-the-art transformer-based code models.