andreamad8 / Universal-Transformer-Pytorch
Implementation of Universal Transformer in Pytorch
☆259Updated 6 years ago
Alternatives and similar repositories for Universal-Transformer-Pytorch:
Users that are interested in Universal-Transformer-Pytorch are comparing it to the libraries listed below
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆268Updated 2 years ago
- ☆396Updated 6 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆577Updated 5 years ago
- Sequence-to-Sequence learning using PyTorch☆523Updated 5 years ago
- Implementation of Dual Learning NMT on PyTorch☆164Updated 6 years ago
- Reinforcement Learning for Neural Machine Translation☆187Updated 2 weeks ago
- Latent Alignment and Variational Attention☆327Updated 6 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆244Updated 4 years ago
- ☆120Updated 5 years ago
- PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (…☆144Updated 5 years ago
- Transformer training code for sequential tasks☆610Updated 3 years ago
- Code for NIPS 2018 paper 'Frequency-Agnostic Word Representation'☆118Updated 5 years ago
- LAnguage Modelling Benchmarks☆137Updated 4 years ago
- An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.☆245Updated 6 years ago
- Neural Text Generation with Unlikelihood Training☆310Updated 3 years ago
- Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Re…☆309Updated 6 years ago
- ☆213Updated 4 years ago
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆86Updated last year
- PyTorch Implementation of "A Hierarchical Latent Structure for Variational Conversation Modeling" (NAACL 2018 Oral)☆173Updated 5 months ago
- Recurrent Variational Autoencoder that generates sequential data implemented with pytorch☆359Updated 7 years ago
- Implementation of the paper Tree Transformer☆212Updated 4 years ago
- Re-implement "QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension"☆121Updated 6 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆243Updated 3 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆93Updated 4 years ago
- PyTorch implementation of batched bi-RNN encoder and attention-decoder.☆280Updated 6 years ago
- Source Code for DialogWAE: Multimodal Response Generation with Conditional Wasserstein Autoencoder (https://arxiv.org/abs/1805.12352)☆125Updated 6 years ago
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆226Updated 5 years ago
- Improving the Transformer translation model with document-level context☆172Updated 4 years ago
- Sparse and structured neural attention mechanisms☆223Updated 4 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆226Updated 3 years ago