nblt / F-SAM
[CVPR 2024] Friendly Sharpness-Aware Minimization
☆33Updated 6 months ago
Alternatives and similar repositories for F-SAM:
Users that are interested in F-SAM are comparing it to the libraries listed below
- ☆35Updated 2 years ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated 11 months ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆82Updated last year
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆63Updated 7 months ago
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆38Updated 4 months ago
- ☆57Updated 2 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆20Updated 6 months ago
- ☆30Updated last week
- Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)☆44Updated last month
- ☆63Updated last year
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆67Updated 6 months ago
- Variance Covariance Regularization☆14Updated last year
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆79Updated last year
- ☆45Updated 2 years ago
- [ICML23] On Pitfalls of Test-Time Adaptation☆115Updated last year
- Transformers trained on Tiny ImageNet☆54Updated 2 years ago
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆37Updated 2 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆44Updated last year
- The official codes of our CVPR-2023 paper: Sharpness-Aware Gradient Matching for Domain Generalization☆75Updated last year
- Implementation of ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks, ICML 2021.☆143Updated 3 years ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆16Updated last month
- ☆11Updated 2 years ago
- ☆43Updated last year
- Official implementation of the paper "Masked Autoencoders are Efficient Class Incremental Learners"☆42Updated 11 months ago
- Denoising Masked Autoencoders Help Robust Classification.☆62Updated last year
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆56Updated 2 years ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆23Updated last month
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆83Updated last year