phohenecker / pytorch-transformer
A PyTorch implementation of the Transformer model from "Attention Is All You Need".
☆59Updated 5 years ago
Alternatives and similar repositories for pytorch-transformer:
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
- PyTorch DataLoader for seq2seq☆84Updated 6 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 4 years ago
- Cascaded Text Generation with Markov Transformers☆129Updated 2 years ago
- ☆64Updated 4 years ago
- Checking the interpretability of attention on text classification models☆48Updated 5 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆244Updated 5 years ago
- ☆119Updated 6 years ago
- LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference☆79Updated 3 years ago
- Recurrent Variational Autoencoder with Dilated Convolutions that generates sequential data implemented in pytorch☆71Updated 3 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆61Updated 6 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only☆50Updated 8 months ago
- Highway network implemented in pytorch☆81Updated 7 years ago
- Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"☆88Updated 4 years ago
- Sequence to Sequence Models in PyTorch☆44Updated 7 months ago
- Code for Multi-Head Attention: Collaborate Instead of Concatenate☆152Updated last year
- Adaptive Softmax implementation for PyTorch☆80Updated 5 years ago
- ☆175Updated 4 years ago
- A PyTorch implementation of : Language Modeling with Gated Convolutional Networks.☆99Updated 3 years ago
- Code for "Language GANs Falling Short"☆59Updated 3 years ago
- ☆216Updated 4 years ago
- Code for EMNLP 2019 paper "Attention is not not Explanation"☆58Updated 3 years ago
- Implementation of Stochastic Beam Search using Fairseq☆99Updated 5 years ago
- Non-autoregressive Neural Machine Translation (not a full version)☆71Updated 2 years ago
- Pytorch and Torchtext implementation of Sequence to sequence☆59Updated 7 years ago
- Source Code for DialogWAE: Multimodal Response Generation with Conditional Wasserstein Autoencoder (https://arxiv.org/abs/1805.12352)☆125Updated 6 years ago
- ☆120Updated 5 years ago
- ☆88Updated 8 years ago
- NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation☆69Updated last year