gunchagarg / learning-rate-techniques-kerasLinks
Exploring learning rates to improve model performance
☆19Updated 6 years ago
Alternatives and similar repositories for learning-rate-techniques-keras
Users that are interested in learning-rate-techniques-keras are comparing it to the libraries listed below
Sorting:
- Implementation of Differential Learning Rate in Keras☆11Updated 6 years ago
- This repository contains notebooks showing how to perform mixed precision training in tf.keras 2.0☆12Updated 5 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Updated 4 years ago
- Implementation of Rectified Adam in Keras☆70Updated 6 years ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆38Updated 2 years ago
- Attention based sequence to sequence neural machine translation model built in keras.☆30Updated 7 years ago
- Large Scale BERT Distillation☆33Updated 2 years ago
- Keras wikipedia-based Language Model☆21Updated 7 years ago
- Demonstrates knowledge distillation for image-based models in Keras.☆54Updated 4 years ago
- Gradient accumulation for Keras☆35Updated 4 years ago
- Notebooks to accompany the blog posts about the 2nd place Kaggle RSNA winners: https://github.com/darraghdog/rsna☆30Updated 5 years ago
- RAdam optimizer for keras☆71Updated 5 years ago
- Keras implementation of Global Context Attention blocks☆46Updated 6 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Updated 4 years ago
- ☆34Updated 5 years ago
- Implementing activation functions from scratch in Tensorflow.☆36Updated 3 years ago
- Tensorflow implementation of transformer network from "Attention is all you need" Paper. Also use cases of it!☆16Updated 5 years ago
- ☆44Updated 6 years ago
- The code used fine-tuning of BERT(Transformer Neural Network Architecture)to accurately pick the correct answer among ten choices that be…☆12Updated 5 years ago
- Tensorflow NCE loss in Keras☆34Updated 6 years ago
- SNAIL Attention Block for Keras.☆16Updated 5 years ago
- bert on Jigsaw Unintended Bias in Toxicity Classification☆50Updated 6 years ago
- Multi-class classification with focal loss for imbalanced datasets☆82Updated 5 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 5 years ago
- Radam+lookahead implemented by tensorflow☆11Updated 5 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 5 years ago
- Surface crack images classification using PyTorch Lightning☆11Updated 5 years ago
- Finetune multiple pre-trained Transformer-based models to solve Vietnamese Fake News Detection problem (ReINTEL) in VLSP2020 shared task☆18Updated 4 years ago
- ☆86Updated 2 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Updated 5 years ago