Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
☆75Apr 5, 2019Updated 6 years ago
Alternatives and similar repositories for keras-LAMB-Optimizer
Users that are interested in keras-LAMB-Optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LAMB Optimizer for Large Batch Training (TensorFlow version)☆121Jan 17, 2020Updated 6 years ago
- Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"☆17Sep 6, 2018Updated 7 years ago
- Single shot neural network pruning before training the model, based on connection sensitivity☆11Aug 7, 2019Updated 6 years ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆377Dec 9, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Gold Loss Correction for training neural networks with labels corrupted with severe noise☆13Aug 17, 2019Updated 6 years ago
- Variational inference and disentangled representations through unsupervised learning☆21Mar 2, 2020Updated 6 years ago
- Load embeddings and featurize your sentences.☆31Oct 23, 2024Updated last year
- Keras implementation of Global Context Attention blocks☆46Apr 29, 2019Updated 6 years ago
- Fashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples.…☆12Aug 29, 2018Updated 7 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- ManifoldMixup with support for Interpolated Adversarial training☆17Mar 10, 2020Updated 6 years ago
- ☆15Apr 28, 2020Updated 5 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- This is the repository containing the MRI files and scripts used to analysed the data☆17Jul 13, 2020Updated 5 years ago
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 4 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- ☆26Jul 3, 2020Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Distributed semi-constrained microphone arrays☆31May 4, 2024Updated last year
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Keras implementation of Octave Convolutions☆52Apr 23, 2019Updated 6 years ago
- Bi-LSTM - CRF Named Entity Recognition model for Korean (Keras)☆16Feb 7, 2018Updated 8 years ago
- A plugin for repo2docker that outputs a directory with a shell-script and required files☆13Jun 24, 2025Updated 9 months ago
- Codebase for Efficient yet simple Reinforcement Learning Research Framework☆28Jan 14, 2023Updated 3 years ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- Improved Speech Enhancement GANs☆12Jun 24, 2020Updated 5 years ago
- Solution for N+1 fish, N+2 fish DrivenData competition (2nd place)☆13Sep 12, 2019Updated 6 years ago
- Sberbank Data Science Journey 2018 LightGBM Baseline☆20Oct 1, 2018Updated 7 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 对逻辑回归各种用法的总结,包括线性,多类,并行,分布式,在线,优化方案☆15Jul 14, 2017Updated 8 years ago
- ☆12Dec 22, 2020Updated 5 years ago
- Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding☆14Sep 28, 2017Updated 8 years ago
- Enterprise Solution for Text Classification (using BERT)☆10Dec 26, 2022Updated 3 years ago
- ☆12Apr 15, 2022Updated 3 years ago
- ☆10Dec 3, 2021Updated 4 years ago
- 패스트캠퍼스 파이토치(2018 1/27~) 실습 자료☆38May 27, 2018Updated 7 years ago