timlautk / BCD-for-DNNs-PyTorch
Code for Global Convergence of Block Coordinate Descent in Deep Learning (ICML 2019)
☆35Updated 5 years ago
Alternatives and similar repositories for BCD-for-DNNs-PyTorch:
Users that are interested in BCD-for-DNNs-PyTorch are comparing it to the libraries listed below
- ☆58Updated last year
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆37Updated 5 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆47Updated 3 years ago
- Code for the signSGD paper☆83Updated 4 years ago
- Certifying Some Distributional Robustness with Principled Adversarial Training (https://arxiv.org/abs/1710.10571)☆45Updated 6 years ago
- Implementation of SVRG for training neural networks☆21Updated 5 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆12Updated last year
- Implementation of SVRG and SAGA optimization algorithms for deep learning topics.☆70Updated 4 years ago
- Difference-of-Entropies (DoE) Estimator☆25Updated 2 years ago
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Updated 3 years ago
- Implementation of Effective Sparsification of Neural Networks with Global Sparsity Constraint☆28Updated 2 years ago
- ☆121Updated 8 months ago
- Pytorch version of NIPS'16 "Learning to learn by gradient descent by gradient descent"☆64Updated last year
- Code for Knowledge-Adaptation Priors based on the NeurIPS 2021 paper by Khan and Swaroop.☆16Updated 3 years ago
- ☆89Updated 3 years ago
- ☆15Updated 3 years ago
- PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"☆74Updated 5 years ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Updated 5 years ago
- Coresets via Bilevel Optimization☆65Updated 4 years ago
- PyTorch implementation of FIM and empirical FIM☆58Updated 6 years ago
- PyTorch implementation of efficient algorithms for DRO with CVaR and Chi-Square uncertainty sets☆58Updated 2 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 4 years ago
- demonstration of the information bottleneck theory for deep learning☆61Updated 7 years ago
- Code for "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent" (ICML 2020 - Lifelong Learning Workshop)☆42Updated 2 years ago
- Implementation of an efficient variant of SVRG that relies on mini-batching implemented in Pytorch☆26Updated 6 years ago
- ☆82Updated 5 years ago
- SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847☆31Updated 6 months ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- ☆31Updated 2 years ago
- Code for the paper "Let’s Make Block Coordinate Descent Go Fast"☆46Updated last year