phohenecker / pytorch-transformerLinks

A PyTorch implementation of the Transformer model from "Attention Is All You Need".

☆59

Alternatives and similar repositories for pytorch-transformer

Users that are interested in pytorch-transformer are comparing it to the libraries listed below

Sorting:

bastings / annotated_encoder_decoder
The Annotated Encoder Decoder with Attention
☆166Updated 4 years ago
harvardnlp / TextFlow
☆121Updated 5 years ago
yunjey / seq2seq-dataloader
PyTorch DataLoader for seq2seq
☆85Updated 6 years ago
benkrause / dynamiceval-transformer
☆47Updated 6 years ago
harvardnlp / urnng
☆178Updated 5 years ago
zomux / lanmt
LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference
☆80Updated 3 years ago
ofirpress / YouMayNotNeedAttention
Code for the Eager Translation Model from the paper You May Not Need Attention
☆295Updated 6 years ago
rosinality / adaptive-softmax-pytorch
Adaptive Softmax implementation for PyTorch
☆81Updated 6 years ago
harvardnlp / cascaded-generation
Cascaded Text Generation with Markov Transformers
☆129Updated 2 years ago
kaushalshetty / Positional-Encoding
Encoding position with the word embeddings.
☆83Updated 7 years ago
kefirski / pytorch_Highway
Highway network implemented in pytorch
☆80Updated 8 years ago
pclucas14 / GansFallingShort
Code for "Language GANs Falling Short"
☆59Updated 4 years ago
cybertronai / transformer-xl
Training Transformer-XL on 128 GPUs
☆140Updated 5 years ago
jiacheng-xu / vmf_vae_nlp
Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"
☆169Updated 6 years ago
google-deepmind / lamb
LAnguage Modelling Benchmarks
☆138Updated 5 years ago
laiguokun / Funnel-Transformer
☆218Updated 5 years ago
XuezheMax / flowseq
Generative Flow based Sequence-to-Sequence Toolkit written in Python.
☆245Updated 5 years ago
zbloss / reformer_lm
a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)
☆53Updated 2 years ago
rdspring1 / PyTorch_GBW_LM
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
☆123Updated 5 years ago
harvardnlp / sa-vae
☆152Updated 7 years ago
salesforce / nonauto-nmt
PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"
☆271Updated 3 years ago
nyu-dl / dl4mt-nonauto
☆119Updated 6 years ago
keitakurita / Better_LSTM_PyTorch
An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.
☆134Updated 5 years ago
imatge-upc / danifojo-2018-repeatrnn
Comparing Fixed and Adaptive Computation Time for Recurrent Neural Networks
☆35Updated 7 years ago
agadetsky / pytorch-definitions
[ACL 2018] Conditional Generators of Words Definitions
☆33Updated 7 years ago
zomux / neuralcompressor
Embedding Quantization (Compress Word Embeddings)
☆86Updated 5 years ago
TimDettmers / transformer-xl
☆64Updated 5 years ago
Mjkim88 / pytorch-torchtext-seq2seq
Pytorch and Torchtext implementation of Sequence to sequence
☆59Updated 7 years ago
kefirski / contiguous-succotash
Recurrent Variational Autoencoder with Dilated Convolutions that generates sequential data implemented in pytorch
☆71Updated 4 years ago
vanzytay / QuaternionTransformers
Repository for ACL 2019 paper
☆72Updated 6 years ago