DSE-MSU / R-transformerLinks

Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.

☆230

Alternatives and similar repositories for R-transformer

Users that are interested in R-transformer are comparing it to the libraries listed below

Sorting:

mttk / rnn-classifier
Minimal RNN classifier with self-attention in Pytorch
☆150Updated 3 years ago
google-deepmind / lamb
LAnguage Modelling Benchmarks
☆138Updated 5 years ago
keitakurita / Better_LSTM_PyTorch
An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.
☆134Updated 5 years ago
andreamad8 / Universal-Transformer-Pytorch
Implementation of Universal Transformer in Pytorch
☆261Updated 6 years ago
laiguokun / Funnel-Transformer
☆218Updated 5 years ago
Stonesjtu / Pytorch-NCE
The Noise Contrastive Estimation for softmax output written in Pytorch
☆319Updated 5 years ago
locuslab / trellisnet
[ICLR'19] Trellis Networks for Sequence Modeling
☆471Updated 5 years ago
XuezheMax / flowseq
Generative Flow based Sequence-to-Sequence Toolkit written in Python.
☆245Updated 5 years ago
seba-1511 / lstms.pth
PyTorch implementations of LSTM Variants (Dropout + Layer Norm)
☆137Updated 4 years ago
tatp22 / linformer-pytorch
My take on a practical implementation of Linformer for Pytorch.
☆417Updated 3 years ago
Rick-McCoy / Reformer-pytorch
Implements Reformer: The Efficient Transformer in pytorch.
☆86Updated 5 years ago
cerebroai / reformers
Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing
☆95Updated 5 years ago
monk1337 / Various-Attention-mechanisms
This repository contain various types of attention mechanism like Bahdanau , Soft attention , Additive Attention , Hierarchical Attention…
☆125Updated 3 years ago
FreedomIntelligence / complex-order
☆83Updated 5 years ago
lancopku / Prime
A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
☆85Updated 2 years ago
baaesh / DiSAN-pytorch
DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding
☆26Updated 7 years ago
b-etienne / Seq2seq-PyTorch
☆76Updated 5 years ago
gitabcworld / skiprnn_pytorch
A pytorch implementation of the paper: "Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks"
☆81Updated 6 years ago
epfml / collaborative-attention
Code for Multi-Head Attention: Collaborate Instead of Concatenate
☆152Updated 2 years ago
budzianowski / PyTorch-Beam-Search-Decoding
PyTorch implementation of beam search decoding for seq2seq models
☆337Updated 2 years ago
guolinke / TUPE
Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…
☆251Updated 3 years ago
kaushalshetty / Positional-Encoding
Encoding position with the word embeddings.
☆83Updated 7 years ago
yunjey / seq2seq-dataloader
PyTorch DataLoader for seq2seq
☆85Updated 6 years ago
andrewpeng02 / transformer-translation
Using Pytorch's nn.Transformer module to create an english to french neural machine translation model.
☆78Updated 5 years ago
bastings / annotated_encoder_decoder
The Annotated Encoder Decoder with Attention
☆166Updated 4 years ago
lingyongyan / Neural-Machine-Translation
PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the pa…
☆38Updated 8 years ago
TensorUI / relative-position-pytorch
a pytorch implementation of self-attention with relative position representations
☆50Updated 4 years ago
nadavbh12 / Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch
Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch
☆61Updated 6 years ago
AuCson / PyTorch-Batch-Attention-Seq2seq
PyTorch implementation of batched bi-RNN encoder and attention-decoder.
☆280Updated 6 years ago
abduallahmohamed / Nested-LSTM-NLSTM-Pytorch
NLSTM Nested LSTM in Pytorch
☆17Updated 7 years ago