rollovd / LookSAMLinks

This is unofficial repository for Towards Efficient and Scalable Sharpness-Aware Minimization.

☆36

Alternatives and similar repositories for LookSAM

Users that are interested in LookSAM are comparing it to the libraries listed below

Sorting:

MarlonBecker / MSAM
☆19Updated last year
AngusDujw / SAF
☆36Updated 2 years ago
andyjm3 / SLTrain
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)
☆32Updated 8 months ago
dydjw9 / Efficient_SAM
☆58Updated 2 years ago
yolky / RFAD
Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"
☆37Updated 2 years ago
lzhangbv / eva
[ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation
☆12Updated last year
mueller-mp / SAM-ON
☆34Updated last year
juntang-zhuang / GSAM
PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)
☆143Updated 2 years ago
stephane-rivaud / ForwardLocalGradient
This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?
☆12Updated 2 years ago
epfml / REQ
☆17Updated last year
mil-ad / prospr
Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients
☆31Updated 3 years ago
gortizji / linearized-networks
Source code of "What can linearized neural networks actually say about generalization?
☆20Updated 3 years ago
tml-epfl / understanding-sam
Towards Understanding Sharpness-Aware Minimization [ICML 2022]
☆35Updated 3 years ago
themrzmaster / git-re-basin-pytorch
Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch
☆76Updated 2 years ago
OPTML-Group / DeepZero
[ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…
☆62Updated 9 months ago
Mi-Peng / Sparse-Sharpness-Aware-Minimization
[NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation
☆45Updated 2 years ago
zhaoyang-0204 / gnp
gradient norm penalty
☆40Updated last year
nblt / RWP
☆11Updated 2 years ago
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated last year
ZichenMiao / CL_Atom_Swapping
ICLR 2022 (Spolight): Continual Learning With Filter Atom Swapping
☆16Updated 2 years ago
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆61Updated 2 years ago
pomonam / jax-influence
A simple Jax implementation of influence functions.
☆16Updated last year
pilancilab / Riemannian_Preconditioned_LoRA
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆27Updated last year
KellerJordan / REPAIR
Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair
☆48Updated last year
tml-epfl / sam-low-rank-features
Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]
☆28Updated last year
locuslab / orthogonal-convolutions
Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness
☆46Updated 4 years ago
locuslab / edge-of-stability
☆70Updated 7 months ago
VITA-Group / ToST
[ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang
☆28Updated 2 years ago
boone891214 / MEST
[NeurIPS‘2021] "MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge", Geng Yuan, Xiaolong Ma, Yanzhi Wang et al…
☆18Updated 3 years ago
vasusingla / simple-data-attribution
A simple and efficient baseline for data attribution
☆11Updated last year