PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)
☆147Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for GSAM
Users that are interested in GSAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆57Feb 13, 2023Updated 3 years ago
- ☆35Dec 5, 2022Updated 3 years ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,975Feb 21, 2024Updated 2 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆48Jun 29, 2023Updated 2 years ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆85Jun 20, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Jan 13, 2023Updated 3 years ago
- The official codes of our CVPR-2023 paper: Sharpness-Aware Gradient Matching for Domain Generalization☆80May 31, 2023Updated 2 years ago
- This is unofficial repository for Towards Efficient and Scalable Sharpness-Aware Minimization.☆37Apr 15, 2024Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- ☆627Mar 25, 2026Updated last month
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Official implementation of the paper "From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client…☆12Mar 13, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a offical PyTorch/GPU implementation of SupMAE.☆80Aug 30, 2022Updated 3 years ago
- Official implementation of "Removing Batch Normalization Boosts Adversarial Training" (ICML'22)☆19Jul 20, 2022Updated 3 years ago
- ☆11Feb 27, 2023Updated 3 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- ☆34Jan 25, 2024Updated 2 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆93Dec 16, 2020Updated 5 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆44Sep 11, 2023Updated 2 years ago
- ☆14Aug 9, 2023Updated 2 years ago
- ☆10Jun 19, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- The offical implement of ImbSAM (Imbalanced-SAM)☆27Mar 4, 2024Updated 2 years ago
- Pytorch (PyG) and Tensorflow (Keras/Spektral) implementation of Total Variation Graph Neural Network (TVGNN), as presented at ICML 2023.☆20Mar 15, 2025Updated last year
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆89Oct 2, 2021Updated 4 years ago
- Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".☆16Jun 28, 2023Updated 2 years ago
- FairSeq repo with Apollo optimizer☆113Dec 20, 2023Updated 2 years ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆86Jun 16, 2021Updated 4 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆433Sep 5, 2023Updated 2 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Mar 25, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2020] "Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free" by Haotao Wang*, Tianlong C…☆44Dec 30, 2021Updated 4 years ago
- Computing various measures and generalization bounds on convolutional and fully connected networks☆35Dec 13, 2018Updated 7 years ago
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆26Jul 21, 2023Updated 2 years ago
- [ICML 2024 spotlight] This repository contains the implementation details for the paper "Locally Estimated Global Perturbations are Bette…☆25Jul 29, 2024Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- The dataset for paper "Why Do We Click: Visual Impression-aware News Recommendation", ACM MM 2021☆15Feb 24, 2022Updated 4 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆35Apr 16, 2023Updated 3 years ago