zhangxiangnick / Transformer-pyLinks
PyTorch implementation of machine translation model in "Attention is all you need"
☆17Updated 7 years ago
Alternatives and similar repositories for Transformer-py
Users that are interested in Transformer-py are comparing it to the libraries listed below
Sorting:
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆62Updated 6 years ago
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆86Updated 2 years ago
- Simple LSTM-based word-level language model in PyTorch☆47Updated 6 years ago
- souce code for "Accelerating Neural Transformer via an Average Attention Network"☆78Updated 6 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Updated 2 years ago
- ☆47Updated 6 years ago
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆123Updated 6 years ago
- Improving the Transformer translation model with document-level context☆170Updated 5 years ago
- CapsNet for NLP☆66Updated 6 years ago
- Code for cross-sentence grammatical error correction using multilayer convolutional seq2seq models (ACL 2019)☆50Updated 5 years ago
- Multiple Different Natural Language Processing Tasks in a Single Deep Model☆48Updated 6 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 3 years ago
- ☆119Updated 6 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 5 years ago
- Bi-Directional Block Self-Attention☆122Updated 7 years ago
- Knowledge Distillation For Transformer Language Models☆53Updated last year
- ☆93Updated 4 years ago
- a simple yet complete implementation of the popular BERT model☆128Updated 5 years ago
- ☆53Updated 5 years ago
- Scripts to train a bidirectional LSTM with knowledge distillation from BERT☆159Updated 6 years ago
- PyTorch implementation of Attention-over-Attention Neural Networks for Reading Comprehension☆59Updated 8 years ago
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆271Updated 3 years ago
- Compare-Aggregate method for WikiQA (via PyTorch)☆28Updated 7 years ago
- Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling☆147Updated 5 years ago
- Implementation of Densely Connected Attention Propagation for Reading Comprehension (NIPS 2018)☆69Updated 6 years ago
- My implementation of the FusionNet for machine comprehension☆27Updated 7 years ago
- ☆74Updated 7 years ago
- Beam search for neural network sequence to sequence (encoder-decoder) models.☆34Updated 6 years ago
- A PyTorch implementation of Attention is all you need☆42Updated 7 years ago
- A Pytorch implementation of QANet☆91Updated 7 years ago