sayakpaul / Adaptive-Gradient-ClippingLinks
Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.
☆85Updated 4 years ago
Alternatives and similar repositories for Adaptive-Gradient-Clipping
Users that are interested in Adaptive-Gradient-Clipping are comparing it to the libraries listed below
Sorting:
- Experiments with the ideas presented in https://arxiv.org/abs/2003.00152 by Frankle et al.☆29Updated 5 years ago
- This is the code base for paper "G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling" by Souradip…☆78Updated 3 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Implementation of modern data augmentation techniques in TensorFlow 2.x to be used in your training pipeline.☆34Updated 5 years ago
- This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to F…☆36Updated 5 years ago
- ☆47Updated 4 years ago
- (unofficial) - customized fork of DETR, optimized for intelligent obj detection on 'real world' custom datasets☆12Updated 5 years ago
- Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.☆60Updated 3 years ago
- Unofficial PyTorch Implementation of EvoNorm☆122Updated 3 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- Neural Arithmetic Logic Units by Trask et al.☆12Updated 6 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated 2 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆128Updated last year
- Pruning applied to Facial Recognition.☆15Updated 6 years ago
- ☆54Updated 3 years ago
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆57Updated 3 years ago
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.☆87Updated 3 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Implementation of Feedback Transformer in Pytorch☆107Updated 4 years ago
- (partial) replication of results from https://arxiv.org/abs/1912.07768☆26Updated 5 years ago
- Pretrained TorchVision models on CIFAR10 dataset (with weights)☆24Updated 5 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 5 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 3 years ago
- Unofficial pytorch implementation of ReZero in ResNet☆23Updated 5 years ago
- FasterAI: A repository for making smaller and faster models with the FastAI library.☆35Updated last year
- GAN models implemented with Pytorch Lightning and Hydra configuration☆34Updated 3 years ago
- a mini Deep Learning framework supporting GPU accelerations written with CUDA☆32Updated 4 years ago