LOG-postech / SasshaLinks

Official Pytorch code for "SASSHA: Sharpness-aware Adaptive Second-order Optimization With Stable Hessian Approximation"

☆14

Alternatives and similar repositories for Sassha

Users that are interested in Sassha are comparing it to the libraries listed below

Sorting:

LOG-postech / SAM-overparam
Code for UAI 2025 paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"
☆19Updated last month
edong6768 / Malet
🔨 Malet (Machine Learning Experiment Tool) is a tool for efficient machine learning experiment execution, logging, analysis, and plot ma…
☆17Updated 2 months ago
LOG-postech / ZIP
☆13Updated 3 months ago
LOG-postech / rethinking-LLM-pruning
☆27Updated 4 months ago
ssbin4 / Closer-Intervention-CBM
☆24Updated last year
devansh20la / LPF-SGD
☆17Updated 2 years ago
ml-postech / multi-armed-bandit-algorithm-against-strategic-replication
Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"
☆14Updated 3 years ago
ml-postech / robust-deep-learning-from-crowds-with-belief-propagation
Official PyTorch implementation of "Robust Deep Learning from Crowds with Belief Propagation"
☆19Updated 3 years ago
nsfzyzz / loss_landscape_taxonomy
[NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228
☆19Updated 3 years ago
effl-lab / MaskedKD
Official Implementation of "The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers (ECCV 2024)”
☆23Updated 5 months ago
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆102Updated 2 years ago
mueller-mp / SAM-ON
☆34Updated last year
ml-postech / gradient-inversion-generative-image-prior
☆42Updated 3 years ago
INCHEON-CHO / Dynamic_Model_Pruning_with_Feedback
Implement of Dynamic Model Pruning with Feedback with pytorch
☆40Updated 3 years ago
ml-postech / MetaSSD
Meta-Learned Self-Supervised Detection
☆20Updated 3 years ago
nsfzyzz / Generalization_metrics_for_NLP
[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…
☆12Updated 2 years ago
cjyaras / deep-lora-transformers
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
☆13Updated 11 months ago
sangamesh-kodge / class_forgetting
[Deep Unlearning-PyTorch] Class Forgetting as in paper "Deep Unlearning: Fast and Efficient Training-free Approach to Controlled Forgetti…
☆15Updated 11 months ago
nblt / TWA
[ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions
☆27Updated 4 months ago
ml-postech / SSAD
☆11Updated last year
ksachdeva / tiny-imagenet-tfds
tiny-imagenet dataset downloader & reader using tensorflow_datasets (tfds) api
☆20Updated 5 years ago
ml-postech / RoWN
Official PyTorch implementation of "A Rotated Hyperbolic Wrapped Normal Distribution for Hierarchical Representation Learning"
☆28Updated 2 years ago
AngusDujw / SAF
☆36Updated 2 years ago
hzf1174 / RoBoT
Official Implementation of Robustifying and Boosting Training-Free Neural Architecture Search
☆11Updated last year
iamkanghyunchoi / qimera
Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]
☆33Updated 3 years ago
neuralcollapse / neuralcollapse
Code reproducing Neural Collapse phenomenon on MSE and cross-entropy loss
☆14Updated 3 years ago
ZhengaoLi / DISP-LLM-Dimension-Independent-Structural-Pruning
An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.
☆20Updated 2 months ago
mccrearyd / rigl-torch
Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.
☆57Updated 3 years ago
Sakshi09Ch / CoDeC
[TMLR] CoDeC: Communication-Efficient Decentralized Continual Learning
☆12Updated last year
ml-postech / HUB
Official implementation of "Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning"
☆15Updated 7 months ago