[UAI 2025] Official code for reproducing paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"
☆19May 14, 2025Updated 9 months ago
Alternatives and similar repositories for SAM-overparam
Users that are interested in SAM-overparam are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Official Pytorch code for "SASSHA: Sharpness-aware Adaptive Second-order Optimization With Stable Hessian Approximation"☆23Aug 11, 2025Updated 6 months ago
- ☆17Nov 10, 2025Updated 3 months ago
- ☆28Feb 21, 2025Updated last year
- ☆28Feb 19, 2024Updated 2 years ago
- Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"☆14May 17, 2022Updated 3 years ago
- tiny-imagenet dataset downloader & reader using tensorflow_datasets (tfds) api☆20Sep 17, 2019Updated 6 years ago
- Official PyTorch implementation of "A Rotated Hyperbolic Wrapped Normal Distribution for Hierarchical Representation Learning"☆28Oct 12, 2022Updated 3 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆10Oct 9, 2017Updated 8 years ago
- ☆13Sep 5, 2024Updated last year
- ☆10Sep 16, 2022Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Sep 11, 2023Updated 2 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Example code for the NNGeometry PyTorch library☆10Aug 20, 2025Updated 6 months ago
- ☆20Feb 3, 2025Updated last year
- Accelerating Transfer Learning with Robust Neural Nets☆11Oct 2, 2020Updated 5 years ago
- Note for quant research, for study☆11Mar 28, 2022Updated 3 years ago
- This repository is a collection of codes generated for optimizing the echo state network for RUL prediction of airplane engines☆10Oct 11, 2021Updated 4 years ago
- ☆25Sep 3, 2025Updated 5 months ago
- ☆12Sep 16, 2024Updated last year
- Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data☆13Sep 20, 2022Updated 3 years ago
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- ☆16Apr 26, 2023Updated 2 years ago
- Binary Classifier Calibration Models☆16Feb 27, 2017Updated 9 years ago
- ☆11Jul 11, 2024Updated last year
- A Mixture Density Layer for Keras☆10Nov 19, 2017Updated 8 years ago
- Tensorflow implementation for structured tabular data☆11Jan 21, 2023Updated 3 years ago
- ☆13Jun 23, 2022Updated 3 years ago
- 3D model classification☆14Jul 2, 2019Updated 6 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- 3D model voxelizer for deep learning applications (e.g. 3D CNN) in additive manufacutring (3D printing)☆11Nov 19, 2021Updated 4 years ago
- Active attention in classification networks that is optimised at the time of model training.☆11Nov 9, 2018Updated 7 years ago
- This repository shows how to efficiently process variable-length sequences in TensorFlow.☆14Apr 26, 2022Updated 3 years ago
- ☆13Aug 10, 2021Updated 4 years ago
- Comparison of gradient estimation techniques for black-box adversarial examples☆11Oct 31, 2018Updated 7 years ago
- ☆13Mar 22, 2023Updated 2 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 4 years ago
- ☆15Dec 7, 2021Updated 4 years ago