Jannoshh / simple-sam
Sharpness-Aware Minimization for Efficiently Improving Generalization
☆41Updated 3 years ago
Alternatives and similar repositories for simple-sam:
Users that are interested in simple-sam are comparing it to the libraries listed below
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆56Updated 2 years ago
- Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.☆60Updated 3 years ago
- ☆73Updated 2 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆128Updated last year
- Implementation of self-supervised image-level contrastive pretraining methods using Keras.☆69Updated 3 years ago
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆140Updated 2 years ago
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.☆87Updated 3 years ago
- Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping☆69Updated 3 years ago
- ☆57Updated 2 years ago
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆134Updated 3 years ago
- Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).☆126Updated 2 years ago
- Semantic Segmentation with Pytorch-Lightning☆63Updated 4 years ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆84Updated 3 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆180Updated 3 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆212Updated 3 years ago
- Implementation of ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks, ICML 2021.☆141Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- ModelSoups for Tensorflow2 and Torch☆48Updated 2 years ago
- A hub for ResNet based models and pretrained weights in TensorFlow.☆21Updated 3 years ago
- An education step by step implementation of SimCLR that accompanies the blogpost☆32Updated 2 years ago
- Pytorch code for managing distributed training experiments.☆21Updated 4 years ago
- Code for the CVPR 2019 paper : Spectral Metric for Dataset Complexity Assessment☆45Updated 11 months ago
- ☆201Updated 2 years ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆60Updated 2 years ago
- Stochastic Weight Averaging Tutorials using pytorch.☆33Updated 4 years ago
- Implementation of Online Label Smoothing in PyTorch☆94Updated 2 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆127Updated last year
- MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space☆40Updated 4 years ago
- A PyTorch implementation of Sharpness-Aware Minimization for Efficiently Improving Generalization☆136Updated 3 years ago