divamgupta / attention-translation-keras
Attention based sequence to sequence neural machine translation model built in keras.
☆30Updated 6 years ago
Alternatives and similar repositories for attention-translation-keras:
Users that are interested in attention-translation-keras are comparing it to the libraries listed below
- Implementation of Rectified Adam in Keras☆69Updated 5 years ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆37Updated 2 years ago
- Exploring learning rates to improve model performance☆19Updated 5 years ago
- Learning rate multiplier☆46Updated 3 years ago
- code for 3rd place kaggle tensorflow competition☆96Updated 6 years ago
- 17th place solution in "Google Landmark Retrieval Challenge"☆6Updated 5 years ago
- ☆24Updated 5 years ago
- SNAIL Attention Block for Keras.☆16Updated 4 years ago
- RAdam optimizer for keras☆71Updated 5 years ago
- Keras + Universal Sentence Encoder = Transfer Learning for text data☆33Updated 6 years ago
- Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"☆17Updated 6 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 5 years ago
- Lookahead mechanism for optimizers in Keras.☆49Updated 3 years ago
- bert on Jigsaw Unintended Bias in Toxicity Classification☆50Updated 5 years ago
- ☆43Updated 6 years ago
- ☆23Updated 7 years ago
- Multi heads attention for image classification☆81Updated 6 years ago
- AdaBound optimizer in Keras☆56Updated 4 years ago
- Position embedding layers in Keras☆58Updated 3 years ago
- ☆19Updated 5 years ago
- LM, ULMFit et al.☆46Updated 5 years ago
- ☆8Updated 5 years ago
- Pytorch implementation of Dauphin et al. (2016) "Language Modeling with Gated Convolutional Networks"☆29Updated 2 years ago
- Tensorflow Implementation of Densely Connected Bidirectional LSTM with Applications to Sentence Classification☆47Updated 6 years ago
- Kaggle Competition notebooks☆33Updated 5 years ago
- Keras implementation of NovoGrad☆20Updated 4 years ago
- Toy Keras implementation of a seq2seq model with examples.☆47Updated 4 years ago
- Layer normalization implemented in Keras☆60Updated 3 years ago
- Multi-GPU training using Keras with a Tensorflow backend.☆20Updated 7 years ago
- Wrapper for Keras with support to easy data loading and handling and the creation of staged networks.☆28Updated 4 years ago