minhtannguyen / SRSGD
Code base for SRSGD.
☆28Updated 4 years ago
Alternatives and similar repositories for SRSGD:
Users that are interested in SRSGD are comparing it to the libraries listed below
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆44Updated 4 years ago
- This repo contains the code used for NeurIPS 2019 paper "Asymmetric Valleys: Beyond Sharp and Flat Local Minima".☆14Updated 5 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- ☆31Updated 4 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆33Updated 4 years ago
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 4 years ago
- Code for paper "Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow"☆19Updated 4 years ago
- ☆15Updated last year
- ☆19Updated 5 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- ☆29Updated 4 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Updated 4 years ago
- Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)☆21Updated 2 years ago
- Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)☆28Updated 5 years ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Updated 4 years ago
- An adaptive training algorithm for residual network☆15Updated 4 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 4 years ago
- MintNet: Building Invertible Neural Networks with Masked Convolutions☆39Updated 4 years ago
- ICML 2020, Estimating Generalization under Distribution Shifts via Domain-Invariant Representations☆22Updated 4 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆24Updated 3 years ago
- CIFAR-5m dataset☆38Updated 4 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆30Updated 3 years ago
- ☆21Updated 4 years ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"☆10Updated 4 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated last year
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- [ICLR 2020] ”Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference“☆24Updated 3 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent☆13Updated 4 years ago
- Sliced Wasserstein Generator☆23Updated 6 years ago