zeke-xie / stable-weight-decay-regularizationLinks

[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.

☆60

Alternatives and similar repositories for stable-weight-decay-regularization

Users that are interested in stable-weight-decay-regularization are comparing it to the libraries listed below

Sorting:

zeke-xie / Positive-Negative-Momentum
[ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.
☆28Updated 2 years ago
huangleiBuaa / NormalizationSurvey
This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application
☆85Updated 4 years ago
VITA-Group / ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…
☆80Updated last year
haofanwang / awesome-mlp-papers
Recent Advances in MLP-based Models (MLP is all you need!)
☆116Updated 2 years ago
CownowAn / DaSS
Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)
☆24Updated last year
changliu00 / cygen
Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)
☆45Updated 3 years ago
juntang-zhuang / GSAM
PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)
☆144Updated 2 years ago
skhu101 / GM-NAS
Code for our ICLR'2022 paper "Generalizing Few-Shot NAS with Gradient Matching"
☆22Updated 2 years ago
KingJamesSong / DifferentiableSVD
A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…
☆78Updated last year
sanagno / neurips_2022_statistics
☆17Updated 2 years ago
rishikksh20 / ResMLP-pytorch
ResMLP: Feedforward networks for image classification with data-efficient training
☆43Updated 4 years ago
xiusu / ViTAS
Code for ViTAS_Vision Transformer Architecture Search
☆50Updated 4 years ago
zeke-xie / adaptive-inertia-adai
[ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…
☆150Updated 2 years ago
locuslab / deq-ddim
☆61Updated 2 years ago
bellymonster / Weighted-Soft-Label-Distillation
☆57Updated 4 years ago
thu-ml / implicit-normalizing-flows
Code for "Implicit Normalizing Flows" (ICLR 2021 spotlight)
☆36Updated 4 years ago
Delay-Xili / LDR
The official PyTorch implementation of the paper: Xili Dai, Shengbang Tong, et al. "Closed-Loop Data Transcription to an LDR via Minimaxi…
☆63Updated 2 years ago
zhenxingjian / Partial_Distance_Correlation
This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022
☆175Updated 2 years ago
szq0214 / SReT
Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"
☆65Updated 2 years ago
VITA-Group / AsViT
[ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…
☆76Updated 3 years ago
enyac-group / supmae
This is a offical PyTorch/GPU implementation of SupMAE.
☆78Updated 2 years ago
keivanalizadeh / ButterflyTransform
☆41Updated 4 years ago
dydjw9 / Efficient_SAM
☆58Updated 2 years ago
ziplab / EcoFormer
[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"
☆72Updated 2 years ago
LixiaoTHU / RCNN
an implementation of 'Recurrent Convolutional Neural Network for Object Recognition'
☆25Updated 5 years ago
quanlin-wu / dmae
Denoising Masked Autoencoders Help Robust Classification.
☆66Updated 2 years ago
okojoalg / raft-mlp
☆25Updated 3 years ago
fawazsammani / awesome-mlp-mixer
Transformers w/o Attention, based fully on MLPs
☆94Updated last year
VITA-Group / Sandwich-Batch-Normalization
[WACV 2022] "Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity" by Xinyu Gong, Wuyang Chen, Tian…
☆51Updated 3 years ago
zeke-xie / artificial-neural-variability-for-deep-learning
[Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…
☆33Updated 4 years ago