sayakpaul / Adaptive-Gradient-ClippingLinks

Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.

☆85

Alternatives and similar repositories for Adaptive-Gradient-Clipping

Users that are interested in Adaptive-Gradient-Clipping are comparing it to the libraries listed below

Sorting:

sayakpaul / Training-BatchNorm-and-Only-BatchNorm
Experiments with the ideas presented in https://arxiv.org/abs/2003.00152 by Frankle et al.
☆29Updated 4 years ago
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆32Updated 2 years ago
rasbt / cyclemoid-pytorch
Cyclemoid implementation for PyTorch
☆90Updated 3 years ago
ebartrum / lightning_gan_zoo
GAN models implemented with Pytorch Lightning and Hydra configuration
☆34Updated 3 years ago
Randl / kmeans_selfsuper
☆54Updated 3 years ago
digantamisra98 / EvoNorm
Unofficial PyTorch Implementation of EvoNorm
☆122Updated 3 years ago
ayulockin / DataAugmentationTF
Implementation of modern data augmentation techniques in TensorFlow 2.x to be used in your training pipeline.
☆34Updated 5 years ago
htoyryla / DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆59Updated 4 years ago
sayakpaul / FunMatch-Distillation
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
☆87Updated 3 years ago
sayakpaul / Sharpness-Aware-Minimization-TensorFlow
Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.
☆60Updated 3 years ago
lucidrains / kronecker-attention-pytorch
Implementation of Kronecker Attention in Pytorch
☆19Updated 4 years ago
ariG23498 / G-SimCLR
This is the code base for paper "G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling" by Souradip…
☆78Updated 3 years ago
izmailovpavel / torch_swa_examples
☆47Updated 4 years ago
gcastex / PruNet
Pruning applied to Facial Recognition.
☆15Updated 6 years ago
sayakpaul / NALU
Neural Arithmetic Logic Units by Trask et al.
☆12Updated 6 years ago
rwightman / efficientnet-jax
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
☆128Updated last year
lnsmith54 / BOSS
This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to F…
☆36Updated 4 years ago
lucidrains / feedback-transformer-pytorch
Implementation of Feedback Transformer in Pytorch
☆107Updated 4 years ago
lessw2020 / Thunder-Detr
(unofficial) - customized fork of DETR, optimized for intelligent obj detection on 'real world' custom datasets
☆12Updated 4 years ago
FrancescoSaverioZuppichini / Loading-huge-PyTorch-models-with-linear-memory-consumption
Little article showing how to load pytorch's models with linear memory consumption
☆34Updated 2 years ago
MathInf / toroidal
a lightweight transformer library for PyTorch
☆72Updated 3 years ago
fabio-deep / ReZero-ResNet
Unofficial pytorch implementation of ReZero in ResNet
☆23Updated 5 years ago
sayakpaul / MLP-Mixer-CIFAR10
Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.
☆57Updated 3 years ago
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
lucidrains / remixer-pytorch
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Updated 3 years ago
anguelos / tormentor
Pytorch augmentation
☆119Updated last year
kartik4949 / deepops
a mini Deep Learning framework supporting GPU accelerations written with CUDA
☆32Updated 4 years ago
adam-mehdi / MuarAugment
State-of-the-art data augmentation search algorithms in PyTorch
☆47Updated last year
teddykoker / u-noise
Official PyTorch code for U-Noise: Learnable Noise Masks for Interpretable Image Segmentation (ICIP 2021)
☆41Updated 3 years ago
noahgolmant / pytorch-lr-dropout
"Learning Rate Dropout" in PyTorch
☆34Updated 5 years ago