akurniawan / pytorch-transformer
Implementation of "Attention is All You Need" paper
☆32Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch-transformer
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆61Updated 5 years ago
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆123Updated 5 years ago
- ☆53Updated 4 years ago
- Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"☆29Updated 3 years ago
- ☆43Updated 6 years ago
- Unsupervised neural machine translation; weight sharing; GAN☆93Updated 6 years ago
- Beam search for neural network sequence to sequence (encoder-decoder) models.☆34Updated 5 years ago
- A minimal nmt example to serve as an seq2seq+attention reference.☆36Updated 5 years ago
- 'Learning End-to-End Goal-Oriented Dialog with maximal User task success and minimal Human Agent use' - TACL, ACL 2019 oral presentation☆10Updated 4 years ago
- ☆47Updated 5 years ago
- Ordered Neurons LSTM☆30Updated 2 years ago
- Sequence to Sequence Models in PyTorch☆44Updated 3 months ago
- PyTorch DataLoader for seq2seq☆83Updated 5 years ago
- Reference Implementation for WSDM 2018 Paper "Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering"☆68Updated 6 years ago
- ☆120Updated 5 years ago
- Make Torchtext work with Keras.☆18Updated 5 years ago
- ☆47Updated 4 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆54Updated 2 years ago
- Transformer-XL with checkpoint loader☆68Updated 2 years ago
- The code for "An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation" (EMNLP 2018)☆48Updated 6 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Updated 5 years ago
- Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.☆44Updated 4 years ago
- Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)☆82Updated 2 years ago
- Multilingual hierarchical attention networks toolkit☆78Updated 4 years ago
- keras encoder-decoder☆17Updated 6 years ago
- Implementation of Densely Connected Attention Propagation for Reading Comprehension (NIPS 2018)☆70Updated 5 years ago
- Non-autoregressive Neural Machine Translation (not a full version)☆71Updated last year
- Adaptive Softmax implementation for PyTorch☆79Updated 5 years ago