nblt / F-SAMLinks

[CVPR 2024] Friendly Sharpness-Aware Minimization

☆34

Alternatives and similar repositories for F-SAM

Users that are interested in F-SAM are comparing it to the libraries listed below

Sorting:

AngusDujw / SAF
☆35Updated 2 years ago
hananshafi / vits-for-small-scale-datasets
[BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"
☆165Updated 2 years ago
xxgege / GAM
The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".
☆84Updated 2 years ago
eric-ai-lab / PEViT
Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"
☆104Updated 2 years ago
dydjw9 / Efficient_SAM
☆58Updated 2 years ago
Jin-Ying / Multi-Level-Logit-Distillation
Code for 'Multi-level Logit Distillation' (CVPR2023)
☆70Updated last year
VITA-Group / ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…
☆81Updated last year
Rose-STL-Lab / Teleportation-Optimization
[ICLR 2024 Oral] Improving Convergence and Generalization Using Parameter Symmetries
☆29Updated last year
Haochen-Wang409 / DropPos
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
☆62Updated last year
jiachenzhu / VCR
Variance Covariance Regularization
☆14Updated 2 years ago
OscarXZQ / weight-selection
☆186Updated last year
ehuynh1106 / TinyImageNet-Transformers
Transformers trained on Tiny ImageNet
☆58Updated 3 months ago
zju-SWJ / RLD
Official implementation for "Knowledge Distillation with Refined Logits".
☆20Updated last year
JinXins / Adversarial-AutoMixup
Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)
☆70Updated last year
Intellindust-AI-Lab / SURE
“SURE: SUrvey REcipes for building reliable and robust deep networks” (CVPR 2024) & (ECCV 2024 OOD-CV Challenge Winner)
☆71Updated 3 months ago
OPTML-Group / Robust-MoE-CNN
[ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Hua…
☆66Updated 2 years ago
quanlin-wu / dmae
Denoising Masked Autoencoders Help Robust Classification.
☆67Updated 2 years ago
JeongsooP / RGB-no-more
An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers
☆56Updated 2 years ago
hunto / DiffKD
Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023
☆92Updated last year
Mi-Peng / Sparse-Sharpness-Aware-Minimization
[NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation
☆45Updated 2 years ago
zeke-xie / stable-weight-decay-regularization
[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.
☆61Updated last year
Westlake-AI / SEMA
Switch EMA: A Free Lunch for Better Flatness and Sharpness
☆28Updated last year
jiequancui / DKL
Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)
☆48Updated 4 months ago
juntang-zhuang / GSAM
PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)
☆144Updated 3 years ago
jokofa / torch_kmeans
PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.
☆75Updated 2 years ago
JiamingLv / WKD
The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…
☆44Updated 11 months ago
yolky / RFAD
Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"
☆37Updated 2 years ago
zhangq327 / U-MAE
Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders
☆68Updated 2 years ago
lliai / DisWOT-CVPR2023
☆28Updated last year
TomerRonen34 / mixed-resolution-vit
☆54Updated 2 years ago