LOG-postech / SasshaLinks
Official Pytorch code for "SASSHA: Sharpness-aware Adaptive Second-order Optimization With Stable Hessian Approximation"
β13Updated 2 months ago
Alternatives and similar repositories for Sassha
Users that are interested in Sassha are comparing it to the libraries listed below
Sorting:
- Code for UAI 2025 paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"β18Updated 3 weeks ago
- π¨ Malet (Machine Learning Experiment Tool) is a tool for efficient machine learning experiment execution, logging, analysis, and plot maβ¦β17Updated last month
- β13Updated 2 months ago
- β27Updated 3 months ago
- β24Updated last year
- Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"β14Updated 3 years ago
- Official PyTorch implementation of "Robust Deep Learning from Crowds with Belief Propagation"β19Updated 3 years ago
- β11Updated last year
- Meta-Learned Self-Supervised Detectionβ20Updated 3 years ago
- β17Updated 2 years ago
- Official PyTorch implementation of "A Rotated Hyperbolic Wrapped Normal Distribution for Hierarchical Representation Learning"β28Updated 2 years ago
- β41Updated 3 years ago
- Official Implementation of "The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers (ECCV 2024)ββ22Updated 4 months ago
- Official PyTorch implementation of "Hyperbolic VAE via Latent Gaussian Distributions"β20Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228β19Updated 3 years ago
- β35Updated 2 years ago
- β11Updated 2 years ago
- β15Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".β102Updated 2 years ago
- Official implementation of "Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning"β14Updated 7 months ago
- Implement of Dynamic Model Pruning with Feedback with pytorchβ40Updated 2 years ago
- Official implementation of 'Anonymization for Skeleton Action Recognition'β29Updated 2 years ago
- [ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutionsβ27Updated 3 months ago
- β12Updated last year
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arxβ¦β12Updated 2 years ago
- β34Updated last year
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorchβ16Updated last year
- Code for testing DCT plus Sparse (DCTpS) networksβ14Updated 3 years ago
- Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]β32Updated 3 years ago
- [NeurIPS23 (Spotlight)] "Model Sparsity Can Simplify Machine Unlearning" by Jinghan Jia*, Jiancheng Liu*, Parikshit Ram, Yuguang Yao, Gaoβ¦β70Updated last year