Daniil-Selikhanovych / Shampoo_optimizerLinks

Our implementation of Shampoo optimizer based on https://arxiv.org/pdf/1802.09568.pdf

☆12

Alternatives and similar repositories for Shampoo_optimizer

Users that are interested in Shampoo_optimizer are comparing it to the libraries listed below

Sorting:

sayakpaul / Training-BatchNorm-and-Only-BatchNorm
Experiments with the ideas presented in https://arxiv.org/abs/2003.00152 by Frankle et al.
☆29Updated 4 years ago
sayakpaul / NALU
Neural Arithmetic Logic Units by Trask et al.
☆12Updated 6 years ago
juntang-zhuang / ACProp-Optimizer
Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)
☆16Updated 3 years ago
lnsmith54 / BOSS
This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to F…
☆36Updated 4 years ago
facebookresearch / grounding-inductive-biases
reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"
☆17Updated 10 months ago
wolfecameron / optimizers
implements various optimizers from scratch for analysis and comparison
☆9Updated 5 years ago
Holmeswww / PPOGAN
☆25Updated last year
MadryLab / dataset-replication-analysis
☆25Updated 5 years ago
sayakpaul / Adaptive-Gradient-Clipping
Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.
☆85Updated 4 years ago
stanislavfort / adversaries_to_convnext
Adversarial examples to the new ConvNeXt architecture
☆20Updated 3 years ago
ducha-aiki / hardnet-in-fastai2-and-kornia
Re-implementation of local descriptor HardNet training in fasta2+kornia
☆21Updated 5 years ago
ayulockin / DataAugmentationTF
Implementation of modern data augmentation techniques in TensorFlow 2.x to be used in your training pipeline.
☆34Updated 5 years ago
titu1994 / tf_neural_deconvolution
Neural Deconvolutions in Tensorflow
☆12Updated 5 years ago
sayakpaul / AdaMatch-TF
Includes additional materials for the following keras.io blog post.
☆12Updated 4 years ago
microsoft / pytorch_od
PyTorch ObjectDetection Modules and ONNX ops
☆18Updated 2 years ago
lucidrains / kronecker-attention-pytorch
Implementation of Kronecker Attention in Pytorch
☆19Updated 4 years ago
yk / PyTorch_CIFAR10
Pretrained TorchVision models on CIFAR10 dataset (with weights)
☆24Updated 4 years ago
lucidrains / deep-linear-network
A simple implementation of a deep linear Pytorch module
☆21Updated 4 years ago
MadryLab / ImageNetMultiLabel
Fine-grained ImageNet annotations
☆29Updated 5 years ago
shivram1987 / diffGrad
diffGrad: An Optimization Method for Convolutional Neural Networks
☆55Updated 2 years ago
ariG23498 / G-SimCLR
This is the code base for paper "G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling" by Souradip…
☆78Updated 3 years ago
google-research / noisy-fewshot-learning
☆23Updated 4 years ago
oval-group / ali-g
Implementation of the ALI-G algorithm (PyTorch, Tensorflow)
☆22Updated 4 years ago
lessw2020 / Thunder-Detr
(unofficial) - customized fork of DETR, optimized for intelligent obj detection on 'real world' custom datasets
☆12Updated 4 years ago
oguiza / DataAugmentation
☆12Updated 3 years ago
Sharath-girish / LTH-ObjectRecognition
PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition
☆23Updated 4 years ago
guoyongcs / NATv2
Implementation for NATv2.
☆23Updated 4 years ago
minhtannguyen / SRSGD
Code base for SRSGD.
☆28Updated 5 years ago
noahgolmant / pytorch-lr-dropout
"Learning Rate Dropout" in PyTorch
☆34Updated 5 years ago
keivanalizadeh / ButterflyTransform
☆41Updated 4 years ago