timlautk / BCD-for-DNNs-PyTorchLinks
Code for Global Convergence of Block Coordinate Descent in Deep Learning (ICML 2019)
☆37Updated 6 years ago
Alternatives and similar repositories for BCD-for-DNNs-PyTorch
Users that are interested in BCD-for-DNNs-PyTorch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of FIM and empirical FIM☆61Updated 7 years ago
- LipSDP - Lipschitz Estimation for Neural Networks☆71Updated 3 years ago
- ☆91Updated 3 years ago
- Code for the signSGD paper☆91Updated 4 years ago
- Implementation of SVRG and SAGA optimization algorithms for deep learning topics.☆74Updated 4 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 3 years ago
- SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847☆31Updated last year
- ☆124Updated last year
- Coresets via Bilevel Optimization☆67Updated 4 years ago
- Tilted Empirical Risk Minimization (ICLR '21)☆60Updated last year
- Code for the paper "Let’s Make Block Coordinate Descent Go Fast"☆48Updated 2 years ago
- Certifying Some Distributional Robustness with Principled Adversarial Training (https://arxiv.org/abs/1710.10571)☆45Updated 7 years ago
- Pytorch version of NIPS'16 "Learning to learn by gradient descent by gradient descent"☆67Updated 2 years ago
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆41Updated 4 years ago
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Updated 3 years ago
- demonstration of the information bottleneck theory for deep learning☆66Updated 8 years ago
- PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"☆75Updated 5 years ago
- ☆67Updated 6 years ago
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆105Updated 5 years ago
- ☆154Updated 5 years ago
- Lookahead: A Far-sighted Alternative of Magnitude-based Pruning (ICLR 2020)☆32Updated 5 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆43Updated 6 years ago
- Code for "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent" (ICML 2020 - Lifelong Learning Workshop)☆44Updated 3 years ago
- NTK reading group☆87Updated 5 years ago
- Difference-of-Entropies (DoE) Estimator☆25Updated 3 years ago
- Example Code for paper "Provably Faster Algorithms for Bilevel Optimization"☆15Updated 3 years ago
- Mixed integer programming for computing lipschitz constants of ReLU Networks☆17Updated 2 years ago
- ☆83Updated 5 years ago
- Learning Sparse Neural Networks through L0 regularization☆244Updated 5 years ago
- Implementation of SVRG for training neural networks☆23Updated 5 years ago