divamgupta / attention-translation-kerasLinks
Attention based sequence to sequence neural machine translation model built in keras.
☆30Updated 7 years ago
Alternatives and similar repositories for attention-translation-keras
Users that are interested in attention-translation-keras are comparing it to the libraries listed below
Sorting:
- Implementation of Rectified Adam in Keras☆70Updated 6 years ago
- Learning rate multiplier☆46Updated 4 years ago
- SNAIL Attention Block for Keras.☆16Updated 5 years ago
- Tensorflow NCE loss in Keras☆34Updated 7 years ago
- AdamW optimizer for Keras☆115Updated 6 years ago
- Multi heads attention for image classification☆79Updated 7 years ago
- Multi-class classification with focal loss for imbalanced datasets☆82Updated 6 years ago
- ☆44Updated 7 years ago
- Lookahead mechanism for optimizers in Keras.☆50Updated 4 years ago
- Deep Neural Network Ensembles for Extreme Classification☆41Updated 6 years ago
- Keras implementation of NovoGrad☆20Updated 5 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- Jupyter Notebook presentation for class imbalance in binary classification☆48Updated 7 years ago
- Wrapper for Keras with support to easy data loading and handling and the creation of staged networks.☆28Updated 5 years ago
- A python script for a PyTorch feed forward neural network for tabular data using categorical embeddings.☆67Updated 5 years ago
- LSTM and Hierarchical Attention Network on DSVM☆41Updated 8 years ago
- Python scripts to facilitate easy working☆11Updated last year
- TF2.0 port for Augmix paper☆79Updated 5 years ago
- Keras implementation of SDE-Net (ICML 2020).☆15Updated 5 years ago
- Keras implementation of Nested LSTMs☆88Updated 6 years ago
- Keras implementation of NASNet-A☆88Updated 7 years ago
- Toy Keras implementation of a seq2seq model with examples.☆47Updated 5 years ago
- keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)☆71Updated 7 years ago
- AdaBound optimizer in Keras☆56Updated 5 years ago
- ☆23Updated 6 years ago
- Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"☆17Updated 7 years ago
- Keras + Universal Sentence Encoder = Transfer Learning for text data☆33Updated 7 years ago
- Stats 479 Project☆22Updated 6 years ago
- Neural Machine Translation with Attention (PyTorch)☆44Updated 6 years ago
- Gradient accumulation for Keras☆35Updated 4 years ago