Almanac
model

Falcon Mamba

modelactivefalcon-mamba-222421a8·1 events·first seen 28d ago

Aliases: Falcon Mamba

Co-occurring entities

More like this (12)

Recent events (1)

7Hugging Face Blog·28d ago·source ↗

Falcon Mamba: First Strong Attention-Free 7B Model

Technology Innovation Institute (TII) releases Falcon Mamba, a 7B parameter state space model (SSM) based on the Mamba architecture, announced as the first attention-free model at this scale to match or exceed transformer-based models on standard benchmarks. The model is hosted on Hugging Face and represents a significant milestone for SSM-based architectures competing with transformers. This release advances the case for pure SSM models as viable alternatives to attention-based LLMs at the 7B scale.