Lookahead mechanism for optimizers in Keras.
☆50Jun 24, 2021Updated 4 years ago
Alternatives and similar repositories for keras-lookahead
Users that are interested in keras-lookahead are comparing it to the libraries listed below
Sorting:
- RAdam implemented in Keras & TensorFlow☆324Jan 22, 2022Updated 4 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Transformer-XL with checkpoint loader☆67Jan 22, 2022Updated 4 years ago
- lookahead optimizer for keras☆169Oct 14, 2019Updated 6 years ago
- Implementation of Rectified Adam in Keras☆70Aug 24, 2019Updated 6 years ago
- Keras implementation of NovoGrad☆20Aug 21, 2020Updated 5 years ago
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- Python scripts to facilitate easy working☆11Jun 24, 2024Updated last year
- Keras Label Smoothing for Supervised Learning☆11May 15, 2020Updated 5 years ago
- ☆13Jun 18, 2019Updated 6 years ago
- Implementation of XLNet that can load pretrained checkpoints☆169Jan 22, 2022Updated 4 years ago
- Octave convolution☆34Jan 22, 2022Updated 4 years ago
- Metric Learning TF 2.0+Keras Algorithm Implementations for Facial Recognition☆18Dec 8, 2022Updated 3 years ago
- RAdam optimizer for keras☆71Oct 14, 2019Updated 6 years ago
- ☆21Mar 30, 2018Updated 7 years ago
- A wrapper layer for stacking layers horizontally☆228Jan 22, 2022Updated 4 years ago
- Code for EMNLP 2019 paper "Modeling Multi-Action Policy for Task-Oriented Dialogues"☆19Sep 2, 2019Updated 6 years ago
- reimplement efficientnet use tf.keras tf2.0☆17Jun 6, 2019Updated 6 years ago
- How Time Matters: Learning Time-Decay Attention for Contextual Spoken Language Understanding in Dialogue☆21Apr 15, 2018Updated 7 years ago
- Load GPT-2 checkpoint and generate texts☆127Jan 22, 2022Updated 4 years ago
- Keras implementation of Cosine Annealing Scheduler☆43Apr 6, 2020Updated 5 years ago
- Exploring learning rates to improve model performance☆19Jun 6, 2019Updated 6 years ago
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- 2019达观杯信息提取第5名代码☆20Sep 20, 2019Updated 6 years ago
- A good example of deformable convolutional network for mnist classification☆21Oct 15, 2019Updated 6 years ago
- A TensorFlow Keras coding style for reducing boilerplate code in custom layers and models.☆18Jan 21, 2021Updated 5 years ago
- 2019达观杯实体识别☆19Sep 12, 2019Updated 6 years ago
- Play around with NGBoost and compare with LightGBM and XGBoost☆20Jun 17, 2024Updated last year
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Apr 1, 2020Updated 5 years ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers☆169Jan 6, 2022Updated 4 years ago
- 基于hrnet的backbone改进centernet☆23Aug 14, 2019Updated 6 years ago
- 记录每一个常用的深度模型结构的特点(图和代码)☆30Dec 17, 2018Updated 7 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- AdamW optimizer for Keras☆116Aug 9, 2019Updated 6 years ago
- DropBlock implemented in Keras☆26Jan 22, 2022Updated 4 years ago
- Code for the paper "Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets", ICCV 2019☆27Mar 17, 2020Updated 5 years ago
- A library for minimizing the effects of confounding covariates☆15May 28, 2025Updated 9 months ago
- Train a CSRNet for estimating a crowd distribution☆10Jun 8, 2020Updated 5 years ago