Entity · technique

state space model

techniqueactivestate-space-model-ce3a98b7·5 events·first seen May 19, 2026

Aliases: state space model, State-Space Model (SSM), State Space Model (SSM), state-space models

Co-occurring entities

Mamba Hugging Face When Does Tool Use Increase the Expressive Power of Finite-Precision Recurrent Models?CaMBRAIN Self-Supervised Learning Electroencephalography (EEG)Transformers Key-Value Cache Sleep Consolidation Mechanism Multi-hop Graph Retrieval Cellular Automata Fast Weights Falcon Mamba Technology Innovation Institute Mamba2 Bamba

More like this (12)

An Exact Instrument for State Usage in Selective State-Space Models, and the Input-Driven Migration It Reveals physical world model ESM (Evolutionary Scale Modeling)species distribution modeling Energy-Based Models world model VPT Model latent dynamical systems stochastic-deterministic boundary (SDB)Sequential Monte Carlo (SMC)Diffusion Models Model Spec

Recent events (5)

6arXiv · cs.CL·Jul 8, 2026·source ↗

Formal theory of when tool use increases expressive power of finite-precision recurrent models

A new arXiv paper provides an exact, architecture-level characterization of when external tool access increases the computational expressivity of finite-precision recurrent sequence models, including SSMs. The key dichotomy: finite-state tools add essentially nothing (absorbable at logarithmic bit cost), while a single infinite-state tool (a read/write tape) makes the system Turing complete with only O(log|Q| + log|Γ|) bits of controller state. The paper further shows this construction is realized by a natural one-layer finite-precision selective affine SSM, with selectivity identified as essential.

Evaluation and Benchmarking Agent and Tool Ecosystem When Does Tool Use Increase the Expressive Power of Finite-Precision Recurrent Models?state space model

3arXiv · cs.AI·May 28, 2026·source ↗

CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models

CaMBRAIN is a Mamba-based causal state space model designed for real-time, continuous inference on variable-length EEG signals, addressing quadratic scaling limitations of attention-based models. It introduces a multi-stage self-supervised training pipeline for long-range memory retention and achieves state-of-the-art results across three EEG datasets with over 10x throughput improvement.

Long Context Evolution Mamba CaMBRAIN Self-Supervised Learning +2 more

6arXiv · cs.CL·May 26, 2026·source ↗

Language Models Need Sleep: Periodic Context Consolidation via Fast Weights and SSM Blocks

This paper proposes a sleep-like consolidation mechanism for transformer-based LLMs to address the quadratic scaling of attention with context length. During 'sleep' phases, the model performs N offline recurrent passes over accumulated context, updating fast weights in state-space model (SSM) blocks via a learned local rule, then clears the KV cache. The approach is evaluated on synthetic tasks (cellular automata, multi-hop graph retrieval) and math reasoning, where standard transformers and SSM-attention hybrids fail, with performance scaling with sleep duration N.

Long Context Evolution Frontier Model Releases Transformers Key-Value Cache Sleep Consolidation Mechanism +6 more

7Hugging Face Blog·May 19, 2026·source ↗

Falcon Mamba: First Strong Attention-Free 7B Model

Technology Innovation Institute (TII) releases Falcon Mamba, a 7B parameter state space model (SSM) based on the Mamba architecture, announced as the first attention-free model at this scale to match or exceed transformer-based models on standard benchmarks. The model is hosted on Hugging Face and represents a significant milestone for SSM-based architectures competing with transformers. This release advances the case for pure SSM models as viable alternatives to attention-based LLMs at the 7B scale.

Frontier Model Releases Open Weights Progress Mamba Falcon Mamba Hugging Face +3 more

5Hugging Face Blog·May 19, 2026·source ↗

Bamba: Inference-Efficient Hybrid Mamba2 Model

Hugging Face published a blog post introducing Bamba, a hybrid architecture combining Mamba2 state-space layers with attention layers, designed for inference efficiency. The model targets reduced KV-cache memory and improved throughput compared to pure transformer architectures. The post covers architecture details, training approach, and benchmarking results positioning Bamba as a practical alternative for deployment-constrained settings.

Training Infrastructure Frontier Model Releases Mamba2 Bamba Hugging Face +2 more