mknbv / adashift
AdaShift optimizer implementation in PyTorch
☆17Updated 6 years ago
Alternatives and similar repositories for adashift:
Users that are interested in adashift are comparing it to the libraries listed below
- Skoltech 2017 NLA course☆37Updated 6 years ago
- Learning to Initialize Neural Networks for Stable and Efficient Training☆139Updated 2 years ago
- FLOPs and other statistics COunter for Pytorch neural networks☆23Updated 3 years ago
- MUSCO: MUlti-Stage COmpression of neural networks☆72Updated 4 years ago
- Greedy Bayesian Posterior Approximation with Deep Ensembles. A. Tiulpin and M. B. Blaschko. (2021)☆11Updated 2 years ago
- Theoretical Deep Learning: generalization ability☆46Updated 5 years ago
- Code for MSID, a Multi-Scale Intrinsic Distance for comparing generative models, studying neural networks, and more!☆51Updated 5 years ago
- On the New method of Hessian-free second-order optimization☆8Updated 4 years ago
- model-in-the-loop☆42Updated 5 years ago
- A fork of the official TPU models repo with fixes and a solution of the Kaggle Open Images 2019 Object Detection Challenge☆49Updated 5 years ago
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆83Updated 3 years ago
- ☆47Updated 4 years ago
- Modification of PyTorch implementation of YOLOv3 Object Detection.☆17Updated 5 years ago
- Deep Generative Models course, 2021☆22Updated 3 years ago
- Simple implementation of the LSUV initialization in PyTorch☆58Updated last year
- Very simple and short implementation of gradient boosting in 18 lines of code☆9Updated 4 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated 2 years ago
- Presentations of the advanced topics in optimization☆11Updated 5 years ago
- ☆66Updated 7 years ago
- Course "Theories of Deep Learning"☆196Updated 5 years ago
- The Deep Weight Prior, ICLR 2019☆45Updated 4 years ago
- NLA 2018 Skoltech course☆55Updated 6 years ago
- Uncertainty Estimation via Stochastic Batch Normalization☆20Updated 7 years ago
- An implementation of shampoo☆74Updated 7 years ago
- Implementations of quasi-hyperbolic optimization algorithms.☆102Updated 4 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 5 years ago
- Compression of NMT transformer model with tensor methods☆48Updated 5 years ago
- ☆21Updated 2 years ago
- custom cuda kernel for {2, 3}d relative attention with pytorch wrapper☆43Updated 5 years ago
- PyTorch implementation of "SRM : A Style-based Recalibration Module for Convolutional Neural Networks"☆81Updated 5 years ago