LOG-postech / Sassha
Official Pytorch code for "SASSHA: Sharpness-aware Adaptive Second-order Optimization With Stable Hessian Approximation"
β12Updated last month
Alternatives and similar repositories for Sassha
Users that are interested in Sassha are comparing it to the libraries listed below
Sorting:
- Code for UAI 2025 paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"β18Updated this week
- π¨ Malet (Machine Learning Experiment Tool) is a tool for efficient machine learning experiment execution, logging, analysis, and plot maβ¦β17Updated 3 weeks ago
- β13Updated 2 months ago
- β27Updated 2 months ago
- β24Updated last year
- β17Updated 2 years ago
- β35Updated 2 years ago
- Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"β14Updated 3 years ago
- Implement of Dynamic Model Pruning with Feedback with pytorchβ40Updated 2 years ago
- Official PyTorch implementation of "Robust Deep Learning from Crowds with Belief Propagation"β19Updated 3 years ago
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorchβ16Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228β19Updated 3 years ago
- β34Updated last year
- β30Updated 3 years ago
- Code for testing DCT plus Sparse (DCTpS) networksβ14Updated 3 years ago
- [TMLR] CoDeC: Communication-Efficient Decentralized Continual Learningβ12Updated last year
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)β17Updated 6 years ago
- β11Updated last year
- β41Updated 3 years ago
- β67Updated 5 months ago
- Official Implementation of "The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers (ECCV 2024)ββ22Updated 4 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".β100Updated last year
- Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]β31Updated 3 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, Deβ¦β45Updated last year
- Codes for "Learning bounds for risk-sensitive learning," NeurIPS 2020 (or see arXiv 2006.08138)β11Updated 4 years ago
- 2022_AAAI accepted paper, NaturalInversion:Data-Free Image Synthesis Improving Real-World Consistencyβ10Updated 3 years ago
- Official Implementation of LANTERN (ICLR'25) and LANTERN++(ICLRW-SCOPE'25)β13Updated 2 months ago
- Pytorch implementation of the paper "SNIP: Single-shot Network Pruning based on Connection Sensitivity" by Lee et al.β108Updated 6 years ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.β17Updated 2 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)β28Updated 2 years ago