jwang0306 / transformer-pytorch
A PyTorch implementation of Transformer, experimenting with both Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).
☆2Updated 4 years ago
Related projects: ⓘ
- Code for "Understanding and Improving Layer Normalization"☆44Updated 4 years ago
- ☆15Updated 4 years ago
- DisCo Transformer for Non-autoregressive MT☆78Updated 2 years ago
- Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"☆89Updated 3 years ago
- LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference☆79Updated 3 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 3 years ago
- Implementation of Stochastic Beam Search using Fairseq☆96Updated 5 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆53Updated 3 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆24Updated 4 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58Updated 4 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆22Updated last year
- Hard-Coded Gaussian Attention for Neural Machine Translation☆36Updated last year
- ☆20Updated 3 years ago
- Cascaded Text Generation with Markov Transformers☆126Updated last year
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Updated 3 years ago
- ☆95Updated 2 years ago
- PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the pa…☆38Updated 7 years ago
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Updated 4 years ago
- ☆15Updated 2 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 3 years ago
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.