zhangxiangnick / Transformer-py
PyTorch implementation of machine translation model in "Attention is all you need"
☆17Updated 6 years ago
Alternatives and similar repositories for Transformer-py:
Users that are interested in Transformer-py are comparing it to the libraries listed below
- Simple LSTM-based word-level language model in PyTorch☆46Updated 5 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆61Updated 6 years ago
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆86Updated last year
- Sequence to Sequence Models in PyTorch☆44Updated 6 months ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆77Updated 2 years ago
- CapsNet for NLP☆67Updated 6 years ago
- Multiple Different Natural Language Processing Tasks in a Single Deep Model☆48Updated 6 years ago
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆122Updated 5 years ago
- PyTorch implementation of Attention-over-Attention Neural Networks for Reading Comprehension☆60Updated 7 years ago
- ☆17Updated 2 years ago
- Code for cross-sentence grammatical error correction using multilayer convolutional seq2seq models (ACL 2019)☆50Updated 4 years ago
- DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding☆26Updated 6 years ago
- Source code of Knowledge Enhanced Hybrid Neural Network for Text Matching☆17Updated 6 years ago
- Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…☆32Updated 6 years ago
- ☆53Updated 4 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆65Updated 5 years ago
- ☆32Updated 3 years ago
- Implement modern LSTM cell by tensorflow and test them by language modeling task for PTB. Highway State Gating, Hypernets, Recurrent High…☆30Updated 6 years ago
- A novel method of constrained decoding for neural NLG (NNLG) models☆83Updated 4 years ago
- Semi Supervised Learning for Text-Classification☆83Updated 5 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 4 years ago
- Tensorflow implementation of HRED (Hierarchical Recurrent Encoder-decoder).☆20Updated 6 years ago
- ☆47Updated 5 years ago
- Code for NAACL19 Paper "How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection"☆42Updated 5 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 5 years ago
- Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)☆91Updated 6 years ago
- Knowledge Distillation For Transformer Language Models☆52Updated last year
- This repository is for the paper "Document-Level Multi-Aspect Sentiment Classification as Machine Comprehension"☆41Updated 7 years ago
- My implementation of the FusionNet for machine comprehension☆28Updated 6 years ago