soskek / attention_is_all_you_needView external linksLinks
Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.
☆323Oct 3, 2017Updated 8 years ago
Alternatives and similar repositories for attention_is_all_you_need
Users that are interested in attention_is_all_you_need are comparing it to the libraries listed below
Sorting:
- fairseq: Convolutional Sequence to Sequence Learning (Gehring et al. 2017) by Chainer☆67Jun 15, 2017Updated 8 years ago
- Chainer implementation of 'Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering' (https://arxiv.org/abs/1606.09…☆68Dec 28, 2017Updated 8 years ago
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Jun 14, 2017Updated 8 years ago
- Recurrent Highway Networks - Implementations for Tensorflow, Torch7, Theano and Brainstorm☆401Oct 9, 2019Updated 6 years ago
- A Chainer implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆28Jun 20, 2018Updated 7 years ago
- Tensorflow Implementation on "The Cramer Distance as a Solution to Biased Wasserstein Gradients" (https://arxiv.org/pdf/1705.10743.pdf)☆123Dec 10, 2017Updated 8 years ago
- ☆92May 23, 2017Updated 8 years ago
- Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"☆223Nov 9, 2019Updated 6 years ago
- Deep Networks with Stochastic Depth implementation by Chainer☆40Apr 11, 2016Updated 9 years ago
- ChainerMN: Scalable distributed deep learning with Chainer☆206Apr 25, 2019Updated 6 years ago
- Slides/code for the Lisbon machine learning school 2017☆28Jul 27, 2017Updated 8 years ago
- ByteNet for character-level language modelling☆318Aug 23, 2017Updated 8 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- ☆833Jul 12, 2017Updated 8 years ago
- Run a static part of the computational graph written in Chainer with Tensorflow☆20Jan 10, 2017Updated 9 years ago
- Now it is exported as an official example☆13Jan 24, 2018Updated 8 years ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 6 years ago
- Cleaned original source code from my NIPS publication☆158Dec 4, 2017Updated 8 years ago
- Neural Networks Figures☆52May 30, 2017Updated 8 years ago
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,114Jan 4, 2022Updated 4 years ago
- Poincaré Embedding (unofficial)☆229May 7, 2019Updated 6 years ago
- Add-on package for ONNX format support in Chainer☆86Nov 6, 2019Updated 6 years ago
- Chainer implementation of recent GAN variants☆410Mar 24, 2023Updated 2 years ago
- A tutorial about neural machine translation including tips on building practical systems☆369Nov 16, 2016Updated 9 years ago
- ☆143Jul 16, 2017Updated 8 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit☆3,733Sep 17, 2021Updated 4 years ago
- ☆14Apr 12, 2017Updated 8 years ago
- YOLOv2のchainerの再現実装です(darknetのchainerローダと、完全なchainer上での訓練コードを含みます)☆339Sep 26, 2022Updated 3 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,453May 21, 2023Updated 2 years ago
- ☆182Aug 17, 2018Updated 7 years ago
- Multi-Residual Networks☆23Nov 25, 2016Updated 9 years ago
- Example usages of Chainer for natural language processing.☆118Nov 30, 2016Updated 9 years ago
- Densely Connected Convolutional Network implementation by Chainer☆39Jul 15, 2017Updated 8 years ago
- C++ code of "Tree-to-Sequence Attentional Neural Machine Translation (tree2seq ANMT)"☆57Jun 23, 2017Updated 8 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆278Feb 20, 2018Updated 7 years ago
- SegNet implementation & experiments in Chainer☆42Jan 5, 2017Updated 9 years ago
- Deep Character-Level Neural Machine Translation☆71Feb 17, 2017Updated 9 years ago
- A Chainer implementation of WGAN-GP.☆12Oct 4, 2017Updated 8 years ago
- A Neural Network Toolkit.☆177Dec 19, 2019Updated 6 years ago