Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
☆75Apr 5, 2019Updated 6 years ago
Alternatives and similar repositories for keras-LAMB-Optimizer
Users that are interested in keras-LAMB-Optimizer are comparing it to the libraries listed below
Sorting:
- Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"☆17Sep 6, 2018Updated 7 years ago
- This is my code from competition Google Cloud & YouTube-8M Video Understanding Challenge. My solution based on video level features only.☆16Jun 5, 2017Updated 8 years ago
- LAMB Optimizer for Large Batch Training (TensorFlow version)☆121Jan 17, 2020Updated 6 years ago
- Here my code from kaggle competition "Planet: Understanding the Amazon from Space"☆41Jul 24, 2017Updated 8 years ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- Keras implementation of Global Context Attention blocks☆46Apr 29, 2019Updated 6 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- Fashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples.…☆12Aug 29, 2018Updated 7 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Nov 7, 2017Updated 8 years ago
- Developed Desktop Interface for Interacting with the IOT device (Neo SmartPen) FWP-F110. You can write any content on NCode Notebooks and…☆13Oct 1, 2018Updated 7 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 4 years ago
- ☆11Aug 8, 2018Updated 7 years ago
- 5th Winning solution for Humpback whale identification☆48Mar 21, 2019Updated 6 years ago
- Naive Bayes classifier for detection of langage and spelling correction☆10Mar 2, 2020Updated 6 years ago
- A general, modular, and programmable architecture search framework☆124Mar 24, 2023Updated 2 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding☆14Sep 28, 2017Updated 8 years ago
- Keras implementation of Attention Augmented Convolutional Neural Networks☆120Mar 6, 2020Updated 6 years ago
- Intersection Over Union☆15Nov 26, 2017Updated 8 years ago
- Fast javascript implementation of T-SNE with tree-based acceleration☆15Jan 19, 2019Updated 7 years ago
- Gold Loss Correction for training neural networks with labels corrupted with severe noise☆13Aug 17, 2019Updated 6 years ago
- Spatial Decomposition and Transformation Network - TensorFlow☆14Dec 2, 2019Updated 6 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- ☆16Jan 24, 2018Updated 8 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- Bi-LSTM - CRF Named Entity Recognition model for Korean (Keras)☆16Feb 7, 2018Updated 8 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- Sberbank Data Science Journey 2018 LightGBM Baseline☆20Oct 1, 2018Updated 7 years ago
- Code for the "Avito Demand Prediction" Kaggle Challenge☆15Jul 1, 2018Updated 7 years ago
- Parallelized Cross Entropy Method☆14Jul 26, 2023Updated 2 years ago
- Learning to play supermario using A3C algorithm☆12Sep 9, 2018Updated 7 years ago
- Improved Speech Enhancement GANs☆12Jun 24, 2020Updated 5 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- Solution to LSST-PLAsTiCC photometric transient classification challenge☆18Nov 7, 2018Updated 7 years ago
- An open-source implementation of sequence-to-sequence based speech processing engine☆39Jan 11, 2023Updated 3 years ago
- ☆19Jan 27, 2021Updated 5 years ago
- Convolutional Neural Network for Realtime Digit Recognition on Webcam☆48Aug 9, 2017Updated 8 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year