timlautk / BCD-for-DNNs-PyTorchLinks
Code for Global Convergence of Block Coordinate Descent in Deep Learning (ICML 2019)
☆37Updated 6 years ago
Alternatives and similar repositories for BCD-for-DNNs-PyTorch
Users that are interested in BCD-for-DNNs-PyTorch are comparing it to the libraries listed below
Sorting:
- ☆59Updated 2 years ago
- ☆124Updated last year
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆49Updated 3 years ago
- PyTorch implementation of FIM and empirical FIM☆61Updated 7 years ago
- Code for the signSGD paper☆90Updated 4 years ago
- ☆91Updated 3 years ago
- Implementation of SVRG for training neural networks☆22Updated 5 years ago
- Implementation of SVRG and SAGA optimization algorithms for deep learning topics.☆74Updated 4 years ago
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Updated 3 years ago
- Tilted Empirical Risk Minimization (ICLR '21)☆60Updated last year
- Difference-of-Entropies (DoE) Estimator☆25Updated 3 years ago
- PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"☆75Updated 5 years ago
- ☆83Updated 5 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆42Updated 6 years ago
- Compressing Neural Networks using the Variational Information Bottleneck☆66Updated 3 years ago
- LipSDP - Lipschitz Estimation for Neural Networks☆70Updated 3 years ago
- Code for "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent" (ICML 2020 - Lifelong Learning Workshop)☆44Updated 2 years ago
- demonstration of the information bottleneck theory for deep learning☆65Updated 7 years ago
- Pytorch Implementation of the Nonlinear Information Bottleneck☆39Updated last year
- SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847☆31Updated last year
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆105Updated 5 years ago
- Code for the paper "Let’s Make Block Coordinate Descent Go Fast"☆48Updated 2 years ago
- Example Code for paper "Provably Faster Algorithms for Bilevel Optimization"☆15Updated 3 years ago
- NTK reading group☆87Updated 5 years ago
- Certifying Some Distributional Robustness with Principled Adversarial Training (https://arxiv.org/abs/1710.10571)☆45Updated 7 years ago
- ☆67Updated 6 years ago
- Pytorch version of NIPS'16 "Learning to learn by gradient descent by gradient descent"☆67Updated 2 years ago
- The HSIC Bottleneck: Deep Learning without Back-Propagation☆88Updated 4 years ago
- Code for Knowledge-Adaptation Priors based on the NeurIPS 2021 paper by Khan and Swaroop.☆16Updated 3 years ago
- Distributional and Outlier Robust Optimization (ICML 2021)☆27Updated 4 years ago