michaelrzhang / lookaheadLinks
Implementation for the Lookahead Optimizer.
☆241Updated 3 years ago
Alternatives and similar repositories for lookahead
Users that are interested in lookahead are comparing it to the libraries listed below
Sorting:
- PyTorch dataset extended with map, cache etc. (tensorflow.data like)☆329Updated 3 years ago
- Decoupled Weight Decay Regularization (ICLR 2019)☆277Updated 6 years ago
- Totally Versatile Miscellanea for Pytorch☆472Updated 3 years ago
- Implementations of ideas from recent papers☆392Updated 4 years ago
- Experimental ground for optimizing memory of pytorch models☆366Updated 7 years ago
- PyTorch functions and utilities to make your life easier☆194Updated 4 years ago
- Loss Patterns of Neural Networks☆85Updated 3 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆376Updated 4 years ago
- Gradient based Hyperparameter Tuning library in PyTorch☆290Updated 5 years ago
- Prescribed Generative Adversarial Networks☆143Updated 5 years ago
- ☆165Updated 6 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆182Updated 4 years ago
- [NeurIPS 2019] Deep Set Prediction Networks☆100Updated 4 years ago
- ☆144Updated 2 years ago
- A Re-implementation of Fixed-update Initialization☆155Updated 6 years ago
- PyTorch Implementations of Dropout Variants☆87Updated 7 years ago
- hessian in pytorch☆187Updated 4 years ago
- Hypergradient descent☆149Updated last year
- Understanding Training Dynamics of Deep ReLU Networks☆296Updated 3 weeks ago
- Code for the paper: Putting An End to End-to-End: Gradient-Isolated Learning of Representations☆285Updated 2 years ago
- Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.☆110Updated last week
- Estimates the size of a PyTorch model in memory☆358Updated 5 years ago
- Implements stochastic line search☆118Updated 2 years ago
- Code for experiments regarding importance sampling for training neural networks☆329Updated 3 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆274Updated 2 years ago
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆83Updated 3 years ago
- Robust Bi-Tempered Logistic Loss Based on Bregman Divergences. https://arxiv.org/pdf/1906.03361.pdf☆147Updated 3 years ago
- Utilities for Pytorch☆88Updated 2 years ago
- Memory efficient MAML using gradient checkpointing☆85Updated 5 years ago
- Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!☆377Updated 2 years ago