shwinshaker / LipGrowLinks
An adaptive training algorithm for residual network
☆15Updated 4 years ago
Alternatives and similar repositories for LipGrow
Users that are interested in LipGrow are comparing it to the libraries listed below
Sorting:
- ☆12Updated 2 years ago
- ☆22Updated 2 years ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆27Updated 3 years ago
- Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)☆21Updated 2 years ago
- ☆16Updated 2 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆46Updated 5 years ago
- Code base for SRSGD.☆28Updated 5 years ago
- ☆36Updated 4 years ago
- ☆41Updated 2 years ago
- ☆20Updated 5 years ago
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Updated 2 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆45Updated 2 years ago
- ☆25Updated 5 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Updated last year
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- ☆30Updated 4 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated 2 weeks ago
- ☆34Updated 3 weeks ago
- ☆18Updated 2 years ago
- Code for ICLR 2022 Paper, "Controlling Directions Orthogonal to a Classifier"☆35Updated 2 years ago
- Encodings for neural architecture search☆29Updated 4 years ago
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆25Updated 3 years ago
- ☆36Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- ☆19Updated 5 years ago
- Reproducible code for Augmentation paper☆17Updated 6 years ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Updated 5 years ago
- Official PyTorch code release for Implicit Gradient Transport, NeurIPS'19☆21Updated 6 years ago