Jannoshh / simple-samLinks
Sharpness-Aware Minimization for Efficiently Improving Generalization
☆41Updated 3 years ago
Alternatives and similar repositories for simple-sam
Users that are interested in simple-sam are comparing it to the libraries listed below
Sorting:
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆56Updated 3 years ago
- ☆74Updated 2 years ago
- Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.☆60Updated 3 years ago
- Implementation of self-supervised image-level contrastive pretraining methods using Keras.☆70Updated 3 years ago
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆162Updated last year
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆136Updated 3 months ago
- A simple to use pytorch wrapper for contrastive self-supervised learning on any neural network☆141Updated 4 years ago
- Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping☆70Updated 4 years ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆85Updated 4 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Implementation of Fast Transformer in Pytorch☆175Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners☆80Updated 3 years ago
- Implementation of Feedback Transformer in Pytorch☆107Updated 4 years ago
- Exploiting Explainable Metrics for Augmented SGD [CVPR2022]☆45Updated 3 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …☆65Updated 4 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- Axial Positional Embedding for Pytorch☆83Updated 4 months ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆180Updated 4 years ago
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆22Updated 3 years ago
- Minimal implementation of SimSiam (https://arxiv.org/abs/2011.10566) in TensorFlow 2.☆98Updated 4 years ago
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆184Updated 2 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆215Updated 4 years ago
- ☆14Updated 5 years ago
- Layerwise Batch Entropy Regularization☆23Updated 2 years ago
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆143Updated 2 years ago
- A simple implementation of a deep linear Pytorch module☆21Updated 4 years ago
- MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space☆41Updated 4 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago