shwinshaker / LipGrow
An adaptive training algorithm for residual network
☆15Updated 4 years ago
Alternatives and similar repositories for LipGrow:
Users that are interested in LipGrow are comparing it to the libraries listed below
- ☆13Updated 2 years ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆27Updated 3 years ago
- Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)☆21Updated 2 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆44Updated 2 years ago
- ☆41Updated 2 years ago
- Code base for SRSGD.☆28Updated 5 years ago
- ☆36Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆29Updated 2 years ago
- ☆22Updated 2 years ago
- ☆30Updated 4 years ago
- ☆20Updated 4 years ago
- ☆11Updated 2 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆45Updated 5 years ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆62Updated 3 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Updated last year
- Code for Reparameterizable Subset Sampling via Continuous Relaxations, IJCAI 2019.☆55Updated last year
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Reproducible code for Augmentation paper☆17Updated 6 years ago
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Updated 2 years ago
- Winning Solution of the NeurIPS 2020 Competition on Predicting Generalization in Deep Learning☆40Updated 4 years ago
- ☆31Updated 4 years ago
- ☆26Updated 3 years ago
- ☆16Updated last year
- Implementation of Kronecker Attention in Pytorch☆18Updated 4 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 2 years ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Updated 4 years ago
- ☆33Updated 4 years ago