jwang0306 / transformer-pytorch

A PyTorch implementation of Transformer, experimenting with both Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).

☆2

Related projects ⓘ

Alternatives and complementary repositories for transformer-pytorch

Edward-Sun / structured-nart
☆15Updated 4 years ago
m3yrin / nar-latent-alignment
Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437
☆24Updated 4 years ago
ictnlp / RSI-NAT
Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"
☆19Updated 5 years ago
ictnlp / Seq-NAT
Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.
☆23Updated 2 years ago
lancopku / AdaNorm
Code for "Understanding and Improving Layer Normalization"
☆46Updated 4 years ago
zomux / lanmt
LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference
☆79Updated 3 years ago
rosinality / imputer-pytorch
Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch
☆58Updated 4 years ago
facebookresearch / DisCo
DisCo Transformer for Non-autoregressive MT
☆78Updated 2 years ago
NAR-tutorial / acl2022
☆95Updated 2 years ago
fuzihaofzh / repetition-problem-nlg
Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.
☆51Updated 2 years ago
namisan / exdeep-nmt
☆32Updated 3 years ago
RayeRen / multilingual-kd-pytorch
ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation
☆70Updated 4 years ago
ustctf-zz / delibnet
☆14Updated 2 years ago
chenyangh / DSLP
Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation
☆43Updated 10 months ago
ictnlp / BoN-NAT
☆23Updated 4 years ago
layer6ai-labs / T-Fixup
Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"
☆89Updated 3 years ago
Glaciohound / Chimera-ST
A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆46Updated 2 years ago
cloneofsimo / realformer-pytorch
Implementation of RealFormer using pytorch
☆102Updated 3 years ago
fallcat / stupidNMT
Hard-Coded Gaussian Attention for Neural Machine Translation
☆36Updated last year
cooelf / OpenIME
Open Vocabulary Learning for Neural Chinese Pinyin IME (ACL 2020)
☆18Updated 5 years ago
ictnlp / OR-NMT
Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>
☆41Updated 4 years ago
iedwardwangi / MetaAdapter
☆22Updated 3 years ago
zomux / lanmt-ebm
lanmt ebm
☆11Updated 4 years ago
bojone / univae
基于Transformer的单模型、多尺度的VAE模型
☆53Updated 3 years ago
berniebear / Multi-HT100M
☆51Updated 2 years ago
ddkang / loss_dropper
☆51Updated 4 years ago
guolinke / fused_ops
☆10Updated 2 years ago
RUCAIBox / ELMER
This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…
☆26Updated 2 years ago
ictnlp / TLAT-NMT
Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.
☆20Updated 2 years ago
TurboNLP / Translate-Demo
A Translation Task using TurboTransformers
☆11Updated 3 years ago