technique
ProbSparse Attention
techniqueactive
probsparse-attention-29da2dc3·1 events·first seen 28d agoAliases: ProbSparse Attention
Co-occurring entities
More like this (12)
Block Sparse Attentionsparse attentionMiniMax Sparse AttentionDeepSeek Sparse AttentionCross-Layer Sparse AttentionLocality-Sensitive Hashing AttentionDifferential AttentionCross-Layer Sparse Attention with Shared RoutingSparse AutoencoderSparse TransformerNeuronal Stochastic Attention Circuit (NSAC)reference attention
Recent events (1)
Multivariate Probabilistic Time Series Forecasting with Informer
A Hugging Face blog post introduces the Informer model for multivariate probabilistic time series forecasting. The post covers the architecture and usage of Informer, which uses a sparse attention mechanism (ProbSparse) to handle long sequences more efficiently than standard Transformers. It demonstrates how to use the model via the Hugging Face Transformers library for forecasting tasks.