wyzjack / AdaM3Links

[ICDM 2023] Momentum is All You Need for Data-Driven Adaptive Optimization

☆26

Alternatives and similar repositories for AdaM3

Users that are interested in AdaM3 are comparing it to the libraries listed below

Sorting:

VITA-Group / ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…
☆80Updated last year
yueb17 / PEMN
☆20Updated 2 years ago
sndnyang / Diffusion_ViT
PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"
☆49Updated 2 years ago
quanlin-wu / dmae
Denoising Masked Autoencoders Help Robust Classification.
☆66Updated 2 years ago
roymiles / VkD
[CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections
☆55Updated 9 months ago
zhangq327 / U-MAE
Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders
☆67Updated last year
Huage001 / DatasetFactorization
PyTorch implementation of paper "Dataset Distillation via Factorization" in NeurIPS 2022.
☆66Updated 2 years ago
JngwenYe / PNCloning
an official PyTorch implementation of the paper "Partial Network Cloning", CVPR 2023
☆13Updated 2 years ago
yu-rp / Distribution-Shift-Iverson
☆42Updated last year
forever208 / EDM-ES
[ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'
☆27Updated last year
jiahaolu97 / anything-unsegmentable
(CVPR 2024) "Unsegment Anything by Simulating Deformation"
☆28Updated last year
GuoQiushan / EGC
☆43Updated last year
AngusDujw / FTD-distillation
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
☆40Updated 2 years ago
JegZheng / MS-MLP
Pytorch implementation of Mix-Shifting-MLP (MS-MLP)
☆16Updated 3 years ago
xinliu20 / MEC
☆45Updated 2 years ago
hunto / DiffKD
Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023
☆88Updated last year
MingSun-Tse / TPP
[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)
☆33Updated 2 years ago
enyac-group / supmae
This is a offical PyTorch/GPU implementation of SupMAE.
☆78Updated 2 years ago
rgeirhos / dataset-pruning-metrics
Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)
☆56Updated 2 years ago
MingSun-Tse / Good-DA-in-KD
[NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective
☆37Updated 2 years ago
zeke-xie / stable-weight-decay-regularization
[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.
☆60Updated last year
jiawangbai / HAT
Implementation of HAT https://arxiv.org/pdf/2204.00993
☆50Updated last year
changlin31 / AutoProg
(CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers
☆25Updated 5 months ago
lilijiangg / AutoDiffusion
☆45Updated last year
VILA-Lab / i-mae
i-mae Pytorch Repo
☆19Updated last year
yongchaoz / FRePo
Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)
☆47Updated 2 years ago
MingSun-Tse / Why-the-State-of-Pruning-so-Confusing
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…
☆40Updated 2 years ago
OliverRensu / SDMP
☆19Updated 2 years ago
ziyuwwang / DynaMixer
☆27Updated 3 years ago
zhenxingjian / Partial_Distance_Correlation
This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022
☆175Updated 2 years ago