zeke-xie / adaptive-inertia-adaiView external linksLinks
[ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum".
☆150Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for adaptive-inertia-adai
Users that are interested in adaptive-inertia-adai are comparing it to the libraries listed below
Sorting:
- [Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…☆33Aug 3, 2021Updated 4 years ago
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆62Feb 3, 2024Updated 2 years ago
- Official Code for ICML 2024 paper "TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision"☆18Nov 18, 2024Updated last year
- ☆11Oct 20, 2023Updated 2 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Spectral Graph Attention Network with Fast Eigen-approximation☆12Dec 24, 2021Updated 4 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆25Jun 5, 2025Updated 8 months ago
- ☆16Dec 7, 2025Updated 2 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated last year
- Dataset and Baselines for "You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization pr…☆11Sep 15, 2023Updated 2 years ago
- ☆11Apr 5, 2021Updated 4 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models☆807Jun 8, 2025Updated 8 months ago
- Official implementation of the ICML 2020 paper "PDO-eConvs: Partial Differential Operator Based Equivariant Convolutions".☆14Jun 2, 2021Updated 4 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)☆29Jan 11, 2022Updated 4 years ago
- [ICML 2024] Recurrent Distance Filtering for Graph Representation Learning☆15Jun 10, 2024Updated last year
- ☆35Dec 5, 2022Updated 3 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- ☆16Dec 13, 2022Updated 3 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- ☆22Dec 8, 2021Updated 4 years ago
- Delta Orthogonal Initialization for PyTorch☆18Jun 27, 2018Updated 7 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆207Apr 24, 2024Updated last year
- Discovering Conservation Laws using Optimal Transport and Manifold Learning☆22Sep 23, 2023Updated 2 years ago
- A GPU performance profiling tool for PyTorch models☆22Jul 5, 2022Updated 3 years ago
- The source code of the paper "Understanding Graph Neural Networks from Graph Signal Denoising Perspectives"☆23Jun 9, 2020Updated 5 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- Interacting with Latent Space of AutoEncoder☆21Nov 22, 2022Updated 3 years ago
- Dataset generation scripts for “Path Planning using Neural A* Search” presented in ICML-21☆24Jan 30, 2023Updated 3 years ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Apr 25, 2023Updated 2 years ago
- [CVPR 2020] Novel Object Viewpoint Estimation through Reconstruction Alignment☆24Jun 7, 2020Updated 5 years ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆283Feb 27, 2023Updated 2 years ago
- ☆26Apr 26, 2024Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Mar 7, 2024Updated last year
- ☆23Jun 15, 2022Updated 3 years ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 2 years ago
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆775Jul 10, 2025Updated 7 months ago