CyberZHG / keras-lookaheadView external linksLinks
Lookahead mechanism for optimizers in Keras.
☆50Jun 24, 2021Updated 4 years ago
Alternatives and similar repositories for keras-lookahead
Users that are interested in keras-lookahead are comparing it to the libraries listed below
Sorting:
- RAdam implemented in Keras & TensorFlow☆325Jan 22, 2022Updated 4 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Transformer-XL with checkpoint loader☆68Jan 22, 2022Updated 4 years ago
- lookahead optimizer for keras☆170Oct 14, 2019Updated 6 years ago
- Implementation of Rectified Adam in Keras☆70Aug 24, 2019Updated 6 years ago
- Keras implementation of NovoGrad☆20Aug 21, 2020Updated 5 years ago
- Python scripts to facilitate easy working☆11Jun 24, 2024Updated last year
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- Keras Label Smoothing for Supervised Learning☆11May 15, 2020Updated 5 years ago
- Ordered Neurons LSTM☆30Jan 22, 2022Updated 4 years ago
- ☆13Jun 18, 2019Updated 6 years ago
- Octave convolution☆34Jan 22, 2022Updated 4 years ago
- ☆19Mar 30, 2018Updated 7 years ago
- Fast methods for non-negative matrix tri-factorization☆16Oct 4, 2019Updated 6 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,208Dec 22, 2023Updated 2 years ago
- A wrapper layer for stacking layers horizontally☆228Jan 22, 2022Updated 4 years ago
- reimplement efficientnet use tf.keras tf2.0☆17Jun 6, 2019Updated 6 years ago
- Code for EMNLP 2019 paper "Modeling Multi-Action Policy for Task-Oriented Dialogues"☆19Sep 2, 2019Updated 6 years ago
- Load GPT-2 checkpoint and generate texts☆127Jan 22, 2022Updated 4 years ago
- Exploring learning rates to improve model performance☆19Jun 6, 2019Updated 6 years ago
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- A good example of deformable convolutional network for mnist classification☆21Oct 15, 2019Updated 6 years ago
- 2019达观杯实体识别☆19Sep 12, 2019Updated 6 years ago
- Play around with NGBoost and compare with LightGBM and XGBoost☆20Jun 17, 2024Updated last year
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Apr 1, 2020Updated 5 years ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- 整理cvpr论文,包括摘要,动机,架构,结果,总结☆27Dec 15, 2018Updated 7 years ago
- Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers☆169Jan 6, 2022Updated 4 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- 记录每一个常用的深度模型结构的特点(图和代码)☆30Dec 17, 2018Updated 7 years ago
- AdamW optimizer for Keras☆116Aug 9, 2019Updated 6 years ago
- DropBlock implemented in Keras☆26Jan 22, 2022Updated 4 years ago
- A library for minimizing the effects of confounding covariates☆15May 28, 2025Updated 8 months ago
- Code for the paper "Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets", ICCV 2019☆27Mar 17, 2020Updated 5 years ago
- Train, save and serve a linear regression model in TensorFlow☆32Sep 30, 2020Updated 5 years ago
- Convert tf.keras/Keras models to ONNX☆382Sep 9, 2021Updated 4 years ago
- The unofficial tensorflow implementation of loss weights of in "Gradient Harmonized Single-stage Detector" published on AAAI 2019 (Oral).…☆29Feb 20, 2020Updated 5 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆204Mar 24, 2023Updated 2 years ago
- Keras Implementation of EfficientNets☆185Mar 17, 2020Updated 5 years ago