soskek / attention_is_all_you_need
Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.
☆319Updated 7 years ago
Alternatives and similar repositories for attention_is_all_you_need:
Users that are interested in attention_is_all_you_need are comparing it to the libraries listed below
- Language Modeling☆156Updated 5 years ago
- ByteNet for character-level language modelling☆318Updated 7 years ago
- Batch normalized LSTM for tensorflow☆179Updated 8 years ago
- Recurrent Highway Networks - Implementations for Tensorflow, Torch7, Theano and Brainstorm☆404Updated 5 years ago
- attention model for entailment on SNLI corpus implemented in Tensorflow and Keras☆177Updated 8 years ago
- TensorFlow implementation of normalizations such as Layer Normalization, HyperNetworks.☆111Updated 8 years ago
- Dynamic Memory Networks (https://arxiv.org/abs/1603.01417) in Tensorflow☆240Updated 8 years ago
- ☆165Updated 8 years ago
- Tensorflow implementation of "Language Modeling with Gated Convolutional Networks"☆271Updated 8 years ago
- End-To-End Memory Network using Tensorflow☆342Updated 8 years ago
- TensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".☆272Updated 7 years ago
- Code for Stanford CS224D: deep learning for natural language understanding☆224Updated 4 years ago
- QRNN implementation for TensorFlow☆236Updated last year
- Adaptive Computation Time algorithm in Tensorflow☆256Updated 7 years ago
- ☆167Updated 8 years ago
- A tensorflow implementation of Fairseq Convolutional Sequence to Sequence Learning(Gehring et al. 2017)☆305Updated 7 years ago
- ☆218Updated 9 years ago
- ☆143Updated 7 years ago
- in progress☆187Updated 7 years ago
- Tensorflow implementation for DilatedRNN☆347Updated 7 years ago
- Tensorflow implementation of Recursive Neural Networks using LSTM units☆136Updated 8 years ago
- Tutorial on "Practical Neural Networks for NLP: From Theory to Code" at EMNLP 2016☆434Updated 8 years ago
- Recurrent Conventinal NN Text Classification for chainer☆129Updated 7 years ago
- Code for Structured Attention Networks https://arxiv.org/abs/1702.00887☆237Updated 7 years ago
- A tensorflow implementation of "Generating Sentences from a Continuous Space"☆227Updated last year
- Gated Attention Reader for Text Comprehension☆188Updated 7 years ago
- Code and models from the paper "Layer Normalization"☆245Updated 8 years ago
- Mixed Incremental Cross-Entropy REINFORCE ICLR 2016☆332Updated 8 years ago
- Sequence to sequence learning using TensorFlow.☆390Updated 6 years ago
- Quasi-recurrent Neural Networks for Keras☆74Updated 7 years ago