IssamLaradji / spsLinks
Official code for the Stochastic Polyak step-size optimizer
☆139Updated last year
Alternatives and similar repositories for sps
Users that are interested in sps are comparing it to the libraries listed below
Sorting:
- Implements stochastic line search☆118Updated 2 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆108Updated last year
- 🧀 Pytorch code for the Fromage optimiser.☆129Updated last year
- 👩 Pytorch and Jax code for the Madam optimiser.☆53Updated 4 years ago
- Codebase for Learning Invariances in Neural Networks☆96Updated 3 years ago
- ☆133Updated 4 years ago
- ☆79Updated 5 years ago
- ☆100Updated 3 years ago
- Loss Patterns of Neural Networks☆86Updated 4 years ago
- Python implementation of GLN in different frameworks☆97Updated 5 years ago
- Bayesianize: A Bayesian neural network wrapper in pytorch☆89Updated last year
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆84Updated 3 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆259Updated 2 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- ☆153Updated 5 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated 2 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated last year
- My implementation of DeepMind's Perceiver☆63Updated 4 years ago
- Code for "Supermasks in Superposition"☆124Updated 2 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆115Updated 3 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …☆66Updated 5 years ago
- Code for NeurIPS 2019 paper: "Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes…☆247Updated 5 years ago
- Code for: Implicit Competitive Regularization in GANs☆115Updated 3 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆128Updated last year
- Quick modules to turn regular Neural Networks to Bayesian Neural Networks with Dropout.☆35Updated 4 years ago
- Prescribed Generative Adversarial Networks☆143Updated 5 years ago
- Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.☆110Updated 2 months ago
- Gradient based Hyperparameter Tuning library in PyTorch☆290Updated 5 years ago