Entity · organization

Astera Institute

organizationactiveastera-institute-235ec9f1·1 events·first seen Jun 1, 2026

Aliases: Astera Institute

Co-occurring entities

University of California San Diego Mamba Stanford University Arnuv Tandon UC Berkeley Gated DeltaNet-2 Sliding Window Attention NVIDIA H100 Needle-in-a-Haystack Karan Dalal TTT-E2E The Pile

More like this (12)

ASTRA Arc Institute Anthropic Academy The Anthropic Institute AstrBot Anthropic Partner Academy Astral Scott Institute for Energy Innovation AISPA AlphaStar Alberta AI Academy AstrBotDevs

Recent events (1)

6The Batch·Jun 1, 2026·source ↗

Test-Time Training End-to-End (TTT-E2E) Retrains Model Weights to Handle Long Inputs

Researchers from Astera Institute, Nvidia, Stanford, UC Berkeley, and UC San Diego introduced TTT-E2E, a method that compresses long context into transformer weights by training the model during inference via meta-learning. The approach uses sliding-window attention restricted to 8,000 tokens and updates only the fully connected layers of the last quarter of the network on each 1,000-token chunk at inference time, keeping per-token generation latency roughly constant as context scales to 128,000 tokens. TTT-E2E slightly outperforms vanilla transformers on next-token prediction loss across long contexts and matches efficient architectures like Mamba 2 and Gated DeltaNet on inference speed, but fails dramatically on Needle-in-a-Haystack retrieval beyond 8,000 tokens and incurs substantially higher training latency. The work reframes long-context handling as a training-inference trade-off rather than an architectural design problem.

Training Infrastructure Long Context Evolution University of California San Diego Mamba Stanford University +13 more