nblt / F-SAMLinks
[CVPR 2024] Friendly Sharpness-Aware Minimization
☆33Updated 7 months ago
Alternatives and similar repositories for F-SAM
Users that are interested in F-SAM are comparing it to the libraries listed below
Sorting:
- ☆35Updated 2 years ago
- Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)☆44Updated this week
- [TMLR 2024] Revisiting Random Weight Perturbation for Efficiently Improving Generalization☆9Updated 7 months ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆82Updated last year
- ☆11Updated 2 years ago
- ☆58Updated 2 years ago
- ☆26Updated last year
- ☆9Updated last year
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated 2 years ago
- Implementation of ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks, ICML 2021.☆145Updated 3 years ago
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆30Updated last year
- ☆30Updated last month
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆37Updated 2 years ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆32Updated 2 years ago
- Variance Covariance Regularization☆14Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆105Updated last year
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆68Updated 7 months ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆79Updated last year
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆56Updated 2 years ago
- ☆43Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆64Updated 8 months ago
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆36Updated 10 months ago
- The offical implement of ImbSAM (Imbalanced-SAM)☆23Updated last year
- Transformers trained on Tiny ImageNet☆54Updated 2 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Updated 2 years ago
- ☆30Updated 3 years ago
- ☆65Updated last year
- gradient norm penalty☆39Updated 11 months ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆28Updated last year