akurniawan / pytorch-transformerLinks

Implementation of "Attention is All You Need" paper

☆33

Alternatives and similar repositories for pytorch-transformer

Users that are interested in pytorch-transformer are comparing it to the libraries listed below

Sorting:

CyberZHG / keras-adaptive-softmax
Adaptive embedding and softmax
☆17Updated 3 years ago
CyberZHG / keras-ordered-neurons
Ordered Neurons LSTM
☆30Updated 3 years ago
santi-pdp / quasi-rnn
Quasi-RNN for language modeling
☆57Updated 8 years ago
benkrause / dynamiceval-transformer
☆47Updated 6 years ago
cerebroai / reformers
Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing
☆95Updated 5 years ago
nadavbh12 / Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch
Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch
☆61Updated 6 years ago
CyberZHG / keras-transformer-xl
Transformer-XL with checkpoint loader
☆68Updated 3 years ago
zbloss / reformer_lm
a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)
☆53Updated 2 years ago
ottokart / beam_search
Beam search for neural network sequence to sequence (encoder-decoder) models.
☆34Updated 6 years ago
DevSinghSachan / Attention_is_All_You_Need
☆42Updated 6 years ago
jojonki / Gated-Convolutional-Networks
A PyTorch implementation of : Language Modeling with Gated Convolutional Networks.
☆99Updated 3 years ago
kklemon / keras-loves-torchtext
Make Torchtext work with Keras.
☆18Updated 6 years ago
contentinnovation / NeurIPS-2018-papers
Machine-generated summaries and highlights of the every accepted paper at Thirty-second Conference on Neural Information Processing Syste…
☆71Updated 6 years ago
toru34 / rush_emnlp_2015
A Neural Attention Model for Abstractive Sentence Summarization in DyNet
☆19Updated 7 years ago
yunjey / seq2seq-dataloader
PyTorch DataLoader for seq2seq
☆85Updated 6 years ago
prajjwal1 / language-modelling
LM, ULMFit et al.
☆46Updated 5 years ago
vanzytay / WSDM2018_HyperQA
Reference Implementation for WSDM 2018 Paper "Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering"
☆68Updated 6 years ago
stefan-it / capsnet-nlp
CapsNet for NLP
☆67Updated 6 years ago
rosinality / adaptive-softmax-pytorch
Adaptive Softmax implementation for PyTorch
☆81Updated 6 years ago
rdspring1 / PyTorch_GBW_LM
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
☆123Updated 5 years ago
MultiPath / Squirrel
PyTorch implementation of Transformer-based Neural Machine Translation
☆78Updated 2 years ago
A-Jacobson / minimal-nmt
A minimal nmt example to serve as an seq2seq+attention reference.
☆36Updated 5 years ago
vanzytay / NIPS2018_DECAPROP
Implementation of Densely Connected Attention Propagation for Reading Comprehension (NIPS 2018)
☆69Updated 6 years ago
yangperasd / gated_cnn
Keras implementation of “Gated Linear Unit ”
☆23Updated last year
ParikhKadam / bidaf-keras
Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2
☆64Updated 2 years ago
sebastianGehrmann / diverse_ensembling
☆27Updated 6 years ago
lancopku / SMAE
This is the code for "Learning Sentiment Memories for Sentiment Modification without Parallel Data".
☆54Updated 6 years ago
shubhamagarwal92 / mmd
This repository contains the Pytorch implementation for our SCAI (EMNLP-2018) submission "A Knowledge-Grounded Multimodal Search-Based Co…
☆29Updated 5 years ago
kelayamatoz / BiDAF-PyTorch
An Implementation of Bidirectional Attention Flow
☆40Updated 7 years ago
bzhangGo / transformer-aan
souce code for "Accelerating Neural Transformer via an Average Attention Network"
☆78Updated 5 years ago