divamgupta / attention-translation-keras
Attention based sequence to sequence neural machine translation model built in keras.
☆30Updated 6 years ago
Alternatives and similar repositories for attention-translation-keras:
Users that are interested in attention-translation-keras are comparing it to the libraries listed below
- Implementation of Rectified Adam in Keras☆69Updated 5 years ago
- Toy Keras implementation of a seq2seq model with examples.☆47Updated 4 years ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆37Updated 2 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆76Updated 5 years ago
- Wrapper for Keras with support to easy data loading and handling and the creation of staged networks.☆28Updated 4 years ago
- ☆23Updated 7 years ago
- Exploring learning rates to improve model performance☆19Updated 5 years ago
- Predict Toxic Comments in the wild☆41Updated 6 years ago
- 17th place solution in "Google Landmark Retrieval Challenge"☆6Updated 5 years ago
- SNAIL Attention Block for Keras.☆16Updated 4 years ago
- Layer normalization implemented in Keras☆60Updated 3 years ago
- Multi-GPU training using Keras with a Tensorflow backend.☆20Updated 7 years ago
- ☆43Updated 6 years ago
- Keras implementation of Nested LSTMs☆89Updated 6 years ago
- AdamW optimizer for Keras☆114Updated 5 years ago
- LSTM and Hierarchical Attention Network on DSVM☆41Updated 7 years ago
- Keras implementation of NovoGrad☆20Updated 4 years ago
- Keras + Universal Sentence Encoder = Transfer Learning for text data☆33Updated 6 years ago
- Experiments of ELMo that deep contextualized word representation in Keras with Tensorflow Hub.☆13Updated 6 years ago
- Google smart reply 2017 implementation in tensorflow☆23Updated 2 years ago
- Implementation of IndRNN in Keras☆67Updated 4 years ago
- Sequence to Sequence and attention from scratch using Tensorflow☆29Updated 7 years ago
- LM, ULMFit et al.☆46Updated 5 years ago
- Neural Deconvolutions in Tensorflow☆12Updated 4 years ago
- code for 3rd place kaggle tensorflow competition☆96Updated 6 years ago
- Implementation of the Transformer architecture described by Vaswani et al. in "Attention Is All You Need"☆28Updated 5 years ago
- ☆37Updated 7 years ago
- Lookahead mechanism for optimizers in Keras.☆49Updated 3 years ago
- keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)☆70Updated 6 years ago
- A Keras implementation of Adaptive Softmax☆7Updated 6 years ago