jwang0306 / transformer-pytorch
A PyTorch implementation of Transformer, experimenting with both Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).
☆2Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for transformer-pytorch
- ☆15Updated 4 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆24Updated 4 years ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆19Updated 5 years ago
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.☆23Updated 2 years ago
- Code for "Understanding and Improving Layer Normalization"☆46Updated 4 years ago
- LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference☆79Updated 3 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58Updated 4 years ago
- DisCo Transformer for Non-autoregressive MT☆78Updated 2 years ago
- ☆95Updated 2 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆51Updated 2 years ago
- ☆32Updated 3 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- ☆14Updated 2 years ago
- Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation☆43Updated 10 months ago
- ☆23Updated 4 years ago
- Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"☆89Updated 3 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆46Updated 2 years ago
- Implementation of RealFormer using pytorch☆102Updated 3 years ago
- Hard-Coded Gaussian Attention for Neural Machine Translation☆36Updated last year
- Open Vocabulary Learning for Neural Chinese Pinyin IME (ACL 2020)☆18Updated 5 years ago
- Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>☆41Updated 4 years ago
- ☆22Updated 3 years ago
- lanmt ebm☆11Updated 4 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆53Updated 3 years ago
- ☆51Updated 2 years ago
- ☆51Updated 4 years ago
- ☆10Updated 2 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Updated 2 years ago
- A Translation Task using TurboTransformers☆11Updated 3 years ago