Rick-McCoy / Reformer-pytorchLinks

Implements Reformer: The Efficient Transformer in pytorch.

☆86

Alternatives and similar repositories for Reformer-pytorch

Users that are interested in Reformer-pytorch are comparing it to the libraries listed below

Sorting:

harvardnlp / cascaded-generation
Cascaded Text Generation with Markov Transformers
☆129Updated 2 years ago
10-zin / Synthesizer
A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"
☆73Updated 2 years ago
kaushalshetty / Positional-Encoding
Encoding position with the word embeddings.
☆84Updated 7 years ago
laiguokun / Funnel-Transformer
☆219Updated 5 years ago
cloneofsimo / realformer-pytorch
Implementation of RealFormer using pytorch
☆101Updated 4 years ago
clovaai / length-adaptive-transformer
Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)
☆102Updated 4 years ago
nng555 / ssmba
☆62Updated 3 years ago
layer6ai-labs / T-Fixup
Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"
☆89Updated 4 years ago
dreamgonfly / transformer-pytorch
A PyTorch implementation of Transformer in "Attention is All You Need"
☆106Updated 4 years ago
tnq177 / transformers_without_tears
Transformers without Tears: Improving the Normalization of Self-Attention
☆133Updated last year
clovaai / subword-qac
Subword Language Model for Query Auto-Completion
☆67Updated 6 years ago
lucidrains / charformer-pytorch
Implementation of the GBST block from the Charformer paper, in Pytorch
☆118Updated 4 years ago
zbloss / reformer_lm
a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)
☆53Updated 2 years ago
DSE-MSU / R-transformer
Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.
☆230Updated 6 years ago
bloodwass / mixout
Implementation of Mixout with PyTorch
☆75Updated 2 years ago
epfml / collaborative-attention
Code for Multi-Head Attention: Collaborate Instead of Concatenate
☆151Updated 2 years ago
KrisKorrel / sparsemax-pytorch
Implementation of Sparsemax activation in Pytorch
☆164Updated 5 years ago
namisan / exdeep-nmt
☆32Updated 4 years ago
hunkim / ACL-2020-Papers
Statistics and Accepted paper list of ACL 2020 with arXiv link
☆23Updated 5 years ago
XuezheMax / flowseq
Generative Flow based Sequence-to-Sequence Toolkit written in Python.
☆246Updated 5 years ago
zomux / lanmt
LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference
☆79Updated 4 years ago
naver-ai / MetricMT
The official code repository for MetricMT - a reward optimization method for NMT with learned metrics
☆25Updated 4 years ago
lucidrains / memory-transformer-xl
A variant of Transformer-XL where the memory is updated not with a queue, but with attention
☆49Updated 5 years ago
heartcored98 / transformer_anatomy
Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
☆16Updated 7 months ago
lucidrains / compressive-transformer-pytorch
Pytorch implementation of Compressive Transformers, from Deepmind
☆162Updated 4 years ago
yaohungt / TransformerDissection
[EMNLP'19] Summary for Transformer Understanding
☆53Updated 5 years ago
RayeRen / multilingual-kd-pytorch
ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation
☆70Updated 5 years ago
cerebroai / reformers
Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing
☆95Updated 5 years ago
XuezheMax / fairseq-apollo
FairSeq repo with Apollo optimizer
☆114Updated last year
lancopku / AdaNorm
Code for "Understanding and Improving Layer Normalization"
☆46Updated 5 years ago