phohenecker / pytorch-transformer
A PyTorch implementation of the Transformer model from "Attention Is All You Need".
☆59Updated 5 years ago
Alternatives and similar repositories for pytorch-transformer:
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
- Encoding position with the word embeddings.☆82Updated 6 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆53Updated 2 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆61Updated 6 years ago
- Cascaded Text Generation with Markov Transformers☆129Updated last year
- Sequence to Sequence Models in PyTorch☆44Updated 5 months ago
- Code for "Language GANs Falling Short"☆58Updated 3 years ago
- PyTorch Implementation of "Unsupervised Learning of Syntactic Structure with Invertible Neural Projections" (EMNLP 2018)☆69Updated 4 years ago
- ☆120Updated 5 years ago
- ☆120Updated 5 years ago
- Recurrent Variational Autoencoder with Dilated Convolutions that generates sequential data implemented in pytorch☆72Updated 3 years ago
- Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only☆50Updated 5 months ago
- Highway network implemented in pytorch☆82Updated 7 years ago
- Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"☆168Updated 6 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- ☆176Updated 4 years ago
- Two-Layer Hierarchical Softmax Implementation for PyTorch☆69Updated 4 years ago
- LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference☆79Updated 3 years ago
- Pytorch and Torchtext implementation of Sequence to sequence☆60Updated 6 years ago
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)☆81Updated 3 years ago
- Checking the interpretability of attention on text classification models☆47Updated 5 years ago
- ☆43Updated 6 years ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Updated 3 years ago
- PyTorch DataLoader for seq2seq☆84Updated 5 years ago
- Non-autoregressive Neural Machine Translation (not a full version)☆71Updated 2 years ago
- Code for EMNLP 2019 paper "Attention is not not Explanation"☆57Updated 3 years ago
- ☆66Updated 2 years ago
- Code for "Strong Baselines for Neural Semi-supervised Learning under Domain Shift" (Ruder & Plank, 2018 ACL)☆60Updated 2 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 3 years ago
- Adaptive Softmax implementation for PyTorch☆79Updated 5 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago