elbayadm / attn2dLinks
Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction
☆502Updated 4 years ago
Alternatives and similar repositories for attn2d
Users that are interested in attn2d are comparing it to the libraries listed below
Sorting:
- Sequence-to-Sequence learning using PyTorch☆521Updated 5 years ago
- ☆396Updated 6 years ago
- Code for the Eager Translation Model from the paper You May Not Need Attention☆295Updated 6 years ago
- Latent Alignment and Variational Attention☆327Updated 6 years ago
- [ICLR'19] Trellis Networks for Sequence Modeling☆471Updated 6 years ago
- An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.☆245Updated 6 years ago
- PyTorch implementation of recurrent batch normalization☆243Updated 6 years ago
- Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning☆310Updated 5 years ago
- Tensorflow implementation for DilatedRNN☆352Updated 7 years ago
- Sequence to Sequence Models with PyTorch☆739Updated 3 years ago
- Sequence-to-Sequence Framework in PyTorch☆391Updated 2 years ago
- Transformer training code for sequential tasks☆612Updated 3 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 4 years ago
- Neural Machine Translation with Keras☆529Updated 4 years ago
- Unsupervised Neural Machine Translation☆474Updated 5 years ago
- Implementation of papers on Deep Seq2seq learning using Pytorch.☆218Updated 6 years ago
- Code for the paper "Adversarially Regularized Autoencoders (ICML 2018)" by Zhao, Kim, Zhang, Rush and LeCun☆398Updated 5 years ago
- PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM☆1,263Updated 3 years ago
- Visualization for Sequential Neural Networks with Attention☆458Updated 2 years ago
- Recurrent Variational Autoencoder that generates sequential data implemented with pytorch☆359Updated 8 years ago
- Implements an efficient softmax approximation as described in the paper "Efficient softmax approximation for GPUs" (http://arxiv.org/abs/…☆397Updated 6 years ago
- Nested LSTM Cell☆251Updated 7 years ago
- ☆472Updated 3 years ago
- TensorFlow implementation of Independently Recurrent Neural Networks☆513Updated 4 years ago
- Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.☆322Updated 7 years ago
- PyTorch implementations of LSTM Variants (Dropout + Layer Norm)☆137Updated 4 years ago
- TensorFlow implementation of 'Attention Is All You Need (2017. 6)'☆349Updated 7 years ago
- Adaptive Computation Time algorithm in Tensorflow☆257Updated 8 years ago
- Sparse and structured neural attention mechanisms☆224Updated 4 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago