idiap / fast-transformers
Pytorch library for fast transformer implementations
☆1,695Updated 2 years ago
Alternatives and similar repositories for fast-transformers:
Users that are interested in fast-transformers are comparing it to the libraries listed below
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,120Updated 3 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,163Updated last year
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆755Updated 11 months ago
- Long Range Arena for Benchmarking Efficient Transformers☆750Updated last year
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,136Updated last year
- My take on a practical implementation of Linformer for Pytorch.☆413Updated 2 years ago
- An All-MLP solution for Vision, from Google AI☆1,016Updated 7 months ago
- Longformer: The Long-Document Transformer☆2,104Updated 2 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,566Updated 4 years ago
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch☆665Updated 4 months ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,101Updated last year
- A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models☆726Updated last year
- Structured state space sequence models☆2,606Updated 9 months ago
- DeLighT: Very Deep and Light-Weight Transformers☆467Updated 4 years ago
- An implementation of local windowed attention for language modeling☆436Updated 3 months ago
- list of efficient attention modules☆999Updated 3 years ago
- A fast MoE impl for PyTorch☆1,699Updated 2 months ago
- Fully featured implementation of Routing Transformer☆293Updated 3 years ago
- PyTorch extensions for high performance and large scale training.☆3,298Updated last week
- ☆376Updated last year
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆607Updated 9 months ago
- Implementation of Linformer for Pytorch☆278Updated last year
- Hopfield Networks is All You Need☆1,787Updated last year
- PyTorch implementation of some attentions for Deep Learning Researchers.☆529Updated 3 years ago
- PyTorch Extension Library of Optimized Scatter Operations☆1,625Updated this week
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,107Updated 4 months ago
- Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch☆1,806Updated 9 months ago
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,611Updated 3 years ago
- Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models☆785Updated 9 months ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 4 years ago