model
Sparse Transformer
modelactive
sparse-transformer-502a222b·1 events·first seen 28d agoAliases: Sparse Transformer
Co-occurring entities
More like this (12)
Recent events (1)
Generative modeling with sparse transformers
OpenAI introduced the Sparse Transformer, a deep neural network using a modified sparse attention mechanism to model sequences up to 30x longer than previously feasible with standard transformers. The approach sets new benchmarks on text, image, and audio generation tasks. The key algorithmic contribution is factorized sparse attention patterns that reduce the quadratic complexity of full self-attention.