zeke-xie / adaptive-inertia-adaiLinks
[ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum".
☆151Updated 2 years ago
Alternatives and similar repositories for adaptive-inertia-adai
Users that are interested in adaptive-inertia-adai are comparing it to the libraries listed below
Sorting:
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆289Updated last year
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆60Updated last year
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆28Updated 3 years ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆85Updated 2 years ago
- ☆84Updated 4 years ago
- Official implementation of "Multi-Task Learning as a Bargaining Game" [ICML 2022]☆231Updated 2 months ago
- Official implementation for Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models (ICML 2022), and a re…☆109Updated 3 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆116Updated 2 years ago
- Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)☆201Updated 2 years ago
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆505Updated last year
- The pure and clear PyTorch Distributed Training Framework.☆274Updated last year
- code to show F-Principle in the DNN training☆59Updated 3 years ago
- [ICLR 2021 top 3%] Is Attention Better Than Matrix Decomposition?☆339Updated 2 years ago
- ☆262Updated 6 months ago
- Code for the paper Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models (ICLR 2022 Outsta…☆174Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 4 months ago
- ReduNet☆539Updated 3 years ago
- ☆197Updated last year
- [NeurIPS 2022] A novel 1-Lipschitz network that can be efficiently trained to achieve certified L-infinity robustness for free!☆31Updated 2 years ago
- A library for calculating the FLOPs in the forward() process based on torch.fx☆127Updated 5 months ago
- PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.☆71Updated 2 years ago
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆144Updated 3 years ago
- Simple tutorials on Pytorch DDP training☆281Updated 3 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆369Updated last year
- Code for "On the Spectral Bias of Neural Networks", to appear in ICML 2019 (Long Beach, CA).☆111Updated 6 years ago
- Reproduce CKA: Similarity of Neural Network Representations Revisited☆311Updated 5 years ago
- [Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…☆33Updated 4 years ago
- Official PyTorch Implementation for Fast Adaptive Multitask Optimization (FAMO)☆97Updated last year
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆218Updated 4 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆81Updated last year