juntang-zhuang / ACProp-OptimizerLinks

Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)

☆16

Alternatives and similar repositories for ACProp-Optimizer

Users that are interested in ACProp-Optimizer are comparing it to the libraries listed below

Sorting:

lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
lucidrains / remixer-pytorch
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Updated 4 years ago
RAIVNLab / LLC
☆14Updated 4 years ago
sayakpaul / AdaMatch-TF
Includes additional materials for the following keras.io blog post.
☆12Updated 4 years ago
facebookresearch / dmae_st
Directed masked autoencoders
☆14Updated 2 years ago
katelyn98 / CorruptionRobustness
We investigated corruption robustness across different architectures including Convolutional Neural Networks, Vision Transformers, and th…
☆16Updated 4 years ago
lucidrains / kronecker-attention-pytorch
Implementation of Kronecker Attention in Pytorch
☆19Updated 5 years ago
dlmacedo / distinction-maximization-loss
A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…
☆44Updated 3 years ago
keivanalizadeh / ButterflyTransform
☆41Updated 4 years ago
shwinshaker / LipGrow
An adaptive training algorithm for residual network
☆17Updated 5 years ago
rehg-lab / CLRec
Pytorch implementation for "The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction"
☆33Updated 3 years ago
rwightman / imagenet-12k
ImageNet-12k subset of ImageNet-21k (fall11)
☆21Updated 2 years ago
sayakpaul / Training-BatchNorm-and-Only-BatchNorm
Experiments with the ideas presented in https://arxiv.org/abs/2003.00152 by Frankle et al.
☆29Updated 5 years ago
ChristophReich1996 / HyperMixer
PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].
☆18Updated 3 years ago
mfederici / dsit
Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"
☆25Updated 4 years ago
RobertCsordas / linear_layer_as_attention
The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …
☆16Updated 5 months ago
Kennethborup / self_distillation
Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression
☆19Updated 4 years ago
rahulvigneswaran / TailCalibX
Pytorch implementation of Feature Generation for Long-Tail Classification by Rahul Vigneswaran, Marc T Law, Vineeth N Balasubramaniam and…
☆38Updated 3 years ago
MadryLab / dataset-replication-analysis
☆25Updated 5 years ago
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆33Updated 3 years ago
lucidrains / compositional-attention-pytorch
Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…
☆51Updated 3 years ago
facebookresearch / grounding-inductive-biases
reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"
☆17Updated last year
MadryLab / ImageNetMultiLabel
Fine-grained ImageNet annotations
☆30Updated 5 years ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago
SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆28Updated last year
VITA-Group / Lifelong-Learning-LTH
[ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…
☆25Updated 3 years ago
VITA-Group / instant_soup
[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…
☆11Updated 2 years ago
lucidrains / deep-linear-network
A simple implementation of a deep linear Pytorch module
☆21Updated 5 years ago
lucidrains / cross-transformers-pytorch
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
☆54Updated 4 years ago
minhtannguyen / SRSGD
Code base for SRSGD.
☆28Updated 5 years ago