sooftware / attentionsLinks
PyTorch implementation of some attentions for Deep Learning Researchers.
☆534Updated 3 years ago
Alternatives and similar repositories for attentions
Users that are interested in attentions are comparing it to the libraries listed below
Sorting:
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,136Updated 3 years ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆786Updated last year
- Pytorch library for fast transformer implementations☆1,724Updated 2 years ago
- Implementation of Transformer encoder in PyTorch☆66Updated 4 years ago
- My take on a practical implementation of Linformer for Pytorch.☆416Updated 2 years ago
- Transformer implementation in PyTorch.☆492Updated 6 years ago
- Implementation of Linformer for Pytorch☆292Updated last year
- Reformer, the efficient Transformer, in Pytorch☆2,174Updated 2 years ago
- An implementation of local windowed attention for language modeling☆459Updated 6 months ago
- Pytorch Lightning code guideline for conferences☆1,273Updated last year
- Learning Rate Warmup in PyTorch☆411Updated 3 weeks ago
- Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch☆427Updated 3 years ago
- An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow☆597Updated 8 months ago
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆322Updated last year
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆259Updated 4 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆759Updated last year
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆90Updated 2 years ago
- An All-MLP solution for Vision, from Google AI☆1,029Updated last week
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆609Updated 2 years ago
- Early stopping for PyTorch☆1,258Updated 8 months ago
- ☆462Updated 2 years ago
- pytorch; mask language model ; bert☆72Updated 5 years ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆1,033Updated 4 years ago
- Pytorch implementation of set transformer☆602Updated 5 years ago
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch☆708Updated last week
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆441Updated last year
- Transformers for Longer Sequences☆617Updated 2 years ago
- Implementation of the first paper on word2vec☆231Updated 3 years ago
- ☆64Updated 5 years ago
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆180Updated 2 years ago