xxgege / GAMLinks
The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".
☆83Updated 2 years ago
Alternatives and similar repositories for GAM
Users that are interested in GAM are comparing it to the libraries listed below
Sorting:
- The offical implement of ImbSAM (Imbalanced-SAM)☆24Updated last year
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆69Updated 7 months ago
- Official PyTorch implementation of PS-KD☆88Updated 2 years ago
- PyTorch implementation of paper "Dataset Distillation via Factorization" in NeurIPS 2022.☆66Updated 2 years ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆247Updated 2 years ago
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆97Updated 3 years ago
- [ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain☆75Updated 2 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆44Updated last year
- ResMLP: Feedforward networks for image classification with data-efficient training☆42Updated 4 years ago
- This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022☆173Updated 2 years ago
- Automated Search for Resource-Efficient Branched Multi-Task Networks [BMVC 2020]☆15Updated last year
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆33Updated 7 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆88Updated last year
- Transformers trained on Tiny ImageNet☆55Updated 2 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆82Updated last year
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆146Updated 2 years ago
- ☆27Updated 2 years ago
- [NeurIPS 2022] A novel 1-Lipschitz network that can be efficiently trained to achieve certified L-infinity robustness for free!☆31Updated 2 years ago
- The official codes of our CVPR-2023 paper: Sharpness-Aware Gradient Matching for Domain Generalization☆75Updated 2 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆65Updated 9 months ago
- ☆57Updated 3 years ago
- Official Implementation of Robust Training under Label Noise by Over-parameterization☆64Updated 2 years ago
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated last year
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆75Updated last year
- Code for ICML 2022 paper — Efficient Test-Time Model Adaptation without Forgetting☆125Updated 2 years ago
- [Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv.org/abs/2409.05202)☆150Updated 8 months ago
- Official implementation of the paper "Masked Autoencoders are Efficient Class Incremental Learners"☆42Updated last year
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆143Updated 2 years ago
- official source code for the Paper: **Long-tailed Visual Recognition via Gaussian Clouded Logit Adjustment** based on Pytorch.☆41Updated last month
- This resposity maintains a collection of important papers on knowledge distillation (awesome-knowledge-distillation)).☆78Updated 3 months ago