This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆294Apr 10, 2024Updated last year
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below
Sorting:
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆27Aug 30, 2022Updated 3 years ago
- [Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…☆33Aug 3, 2021Updated 4 years ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆206Dec 27, 2024Updated last year
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆62Feb 3, 2024Updated 2 years ago
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- ☆74Dec 7, 2024Updated last year
- ☆20Jan 4, 2023Updated 3 years ago
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆778Jul 10, 2025Updated 8 months ago
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- ☆14Oct 18, 2021Updated 4 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆392Jan 7, 2026Updated 2 months ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- ☆26Jun 12, 2025Updated 8 months ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Mar 11, 2025Updated 11 months ago
- Implementation of Beyond Neural Scaling beating power laws for deep models and prototype-based models☆34Oct 30, 2025Updated 4 months ago
- ☆23Nov 1, 2022Updated 3 years ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆20Jan 31, 2021Updated 5 years ago
- ☆26Feb 2, 2023Updated 3 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- The source code the for the ICLR'24 paper "Stabilizing Backpropagation Through Time to Learn Complex Physics"☆11May 17, 2024Updated last year
- ☆44Oct 30, 2025Updated 4 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- Official Implementation of SWAD (NeurIPS 2021)☆170Dec 10, 2022Updated 3 years ago
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- ☆10Mar 18, 2023Updated 2 years ago
- Library for computing the Finite-time Lyapunov Exponents of 2D flows using xarray☆10Apr 25, 2022Updated 3 years ago
- Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)☆1,618Feb 1, 2024Updated 2 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆173Nov 10, 2019Updated 6 years ago
- ☆15Feb 22, 2018Updated 8 years ago
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆16Sep 4, 2024Updated last year
- Repository for the paper "Interpreting Temporal Graph Neural Networks with Koopman Theory"☆11Jan 30, 2025Updated last year
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated last year
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆61Jan 14, 2025Updated last year
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- Quantification of Uncertainties in Neural Networks☆11Feb 25, 2026Updated last week
- ☆14Mar 4, 2022Updated 4 years ago