sooftware / attentions
PyTorch implementation of some attentions for Deep Learning Researchers.
☆529Updated 3 years ago
Alternatives and similar repositories for attentions:
Users that are interested in attentions are comparing it to the libraries listed below
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,119Updated 3 years ago
- My take on a practical implementation of Linformer for Pytorch.☆412Updated 2 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,158Updated last year
- Implementation of Linformer for Pytorch☆276Updated last year
- Pytorch library for fast transformer implementations☆1,690Updated 2 years ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆751Updated 10 months ago
- An implementation of local windowed attention for language modeling☆431Updated 2 months ago
- Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch☆424Updated 3 years ago
- Learning Rate Warmup in PyTorch☆404Updated 2 weeks ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆609Updated 2 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆748Updated last year
- Contrastive Predictive Coding for Automatic Speaker Verification☆491Updated 5 years ago
- Early stopping for PyTorch☆1,248Updated 4 months ago
- Implement the paper "Self-Attention with Relative Position Representations"☆128Updated 4 years ago
- ☆450Updated last year
- A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"☆554Updated 4 years ago
- An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow☆577Updated 5 months ago
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆224Updated last year
- Pytorch Lightning code guideline for conferences☆1,259Updated last year
- Longformer: The Long-Document Transformer☆2,096Updated 2 years ago
- [ICML 2021, Long Talk] Delving into Deep Imbalanced Regression☆860Updated 3 years ago
- ☆64Updated 4 years ago
- Tiny PyTorch library for maintaining a moving average of a collection of parameters.☆426Updated 5 months ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,083Updated 11 months ago
- Deep Learning project template for PyTorch (multi-gpu training is supported)☆136Updated last year
- PyTorch implementation of the InfoNCE loss for self-supervised learning.☆529Updated last year
- Pytorch implementation of set transformer☆571Updated 5 years ago
- A learning rate range test implementation in PyTorch☆950Updated 3 months ago
- An All-MLP solution for Vision, from Google AI☆1,016Updated 6 months ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆89Updated 2 years ago