xl402 / performer
Tensorflow implementation of a linear attention architecture
☆44Updated 4 years ago
Alternatives and similar repositories for performer
Users that are interested in performer are comparing it to the libraries listed below
Sorting:
- Implementation of Feedback Transformer in Pytorch☆106Updated 4 years ago
- Simple stochastic weight averaging callback for Keras☆63Updated 3 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆95Updated 5 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆135Updated last month
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆56Updated 2 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆134Updated 3 years ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆71Updated 2 years ago
- Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.☆74Updated 2 years ago
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆161Updated last year
- Axial Positional Embedding for Pytorch☆79Updated 2 months ago
- Layerwise Batch Entropy Regularization☆23Updated 2 years ago
- Implementation of ETSformer, state of the art time-series Transformer, in Pytorch☆153Updated last year
- Simply Numpy implementation of the FAVOR+ attention mechanism, https://teddykoker.com/2020/11/performers/☆37Updated 4 years ago
- ☆164Updated 2 years ago
- NLP Examples using the 🤗 libraries☆41Updated 4 years ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆60Updated 3 years ago
- Relative Positional Encoding for Transformers with Linear Complexity☆63Updated 3 years ago
- Fourth place solution to the "OpenVaccine: COVID-19 mRNA Vaccine Degradation Prediction" organized by Stanford University and Kaggle☆20Updated 4 years ago
- Transformers without Tears: Improving the Normalization of Self-Attention☆131Updated 11 months ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated 9 months ago
- Tensorflow Implementation of "FNet: Mixing Tokens with Fourier Transforms."☆22Updated 3 years ago
- Various transformers for FSDP research☆37Updated 2 years ago
- ☆74Updated 2 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆78Updated 4 years ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms