akurniawan / pytorch-transformer
Implementation of "Attention is All You Need" paper
☆33Updated 6 months ago
Alternatives and similar repositories for pytorch-transformer:
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- Sequence to Sequence Models in PyTorch☆44Updated 6 months ago
- A minimal nmt example to serve as an seq2seq+attention reference.☆36Updated 5 years ago
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆122Updated 5 years ago
- Beam search for neural network sequence to sequence (encoder-decoder) models.☆34Updated 5 years ago
- CapsNet for NLP☆67Updated 6 years ago
- A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"☆19Updated 7 years ago
- DeepThought's solution☆80Updated last year
- Ordered Neurons LSTM☆30Updated 3 years ago
- Pytorch implementation of "Dynamic Coattention Networks For Question Answering"☆62Updated 6 years ago
- PyTorch DataLoader for seq2seq☆84Updated 5 years ago
- Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)☆81Updated 2 years ago
- ☆53Updated 4 years ago
- Tensorflow implementation of HRED (Hierarchical Recurrent Encoder-decoder).☆20Updated 6 years ago
- ☆47Updated 5 years ago
- LM, ULMFit et al.☆46Updated 5 years ago
- Comparing Text Classification results using BERT embedding and ULMFIT embedding☆65Updated 6 years ago
- Layer normalization implemented in Keras☆60Updated 3 years ago
- NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"☆74Updated 4 years ago
- An Implementation of Bidirectional Attention Flow☆40Updated 7 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)☆63Updated 4 years ago
- fairseq: Convolutional Sequence to Sequence Learning (Gehring et al. 2017) by Chainer☆64Updated 7 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- A PyTorch implementation of a Bi-LSTM CRF with character-level features☆63Updated 6 years ago
- This is the code for "Learning Sentiment Memories for Sentiment Modification without Parallel Data".☆54Updated 6 years ago
- A Neural Attention Model for Abstractive Sentence Summarization in DyNet☆19Updated 6 years ago
- Unsupervised neural machine translation; weight sharing; GAN☆93Updated 6 years ago
- Keras wikipedia-based Language Model☆22Updated 6 years ago
- Encoding position with the word embeddings.☆82Updated 6 years ago
- bert with customized features☆25Updated 5 years ago