xie-lab-ml / deep-learning-dynamics-paper-listView external linksLinks
This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆293Apr 10, 2024Updated last year
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below
Sorting:
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆149Feb 17, 2023Updated 2 years ago
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆27Aug 30, 2022Updated 3 years ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆205Dec 27, 2024Updated last year
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆62Feb 3, 2024Updated 2 years ago
- Neural Tangent Kernel Papers☆121Jan 12, 2025Updated last year
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- ☆73Dec 7, 2024Updated last year
- ☆20Jan 4, 2023Updated 3 years ago
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆775Jul 10, 2025Updated 7 months ago
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- ScalingOpt - Optimization Community☆78Feb 4, 2026Updated last week
- ☆14Oct 18, 2021Updated 4 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆390Jan 7, 2026Updated last month
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- ☆24Jun 12, 2025Updated 8 months ago
- ☆87Jul 18, 2023Updated 2 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Mar 11, 2025Updated 11 months ago
- ☆23Nov 1, 2022Updated 3 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- ☆26Feb 2, 2023Updated 3 years ago
- The source code the for the ICLR'24 paper "Stabilizing Backpropagation Through Time to Learn Complex Physics"☆11May 17, 2024Updated last year
- ☆44Oct 30, 2025Updated 3 months ago
- Official Implementation of SWAD (NeurIPS 2021)☆171Dec 10, 2022Updated 3 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated last month
- Library for computing the Finite-time Lyapunov Exponents of 2D flows using xarray☆10Apr 25, 2022Updated 3 years ago
- Spectral Graph Attention Network with Fast Eigen-approximation☆12Dec 24, 2021Updated 4 years ago
- ☆10Mar 18, 2023Updated 2 years ago
- ☆24Feb 18, 2021Updated 4 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆173Nov 10, 2019Updated 6 years ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆60Jan 14, 2025Updated last year
- ☆13Jul 2, 2025Updated 7 months ago
- ☆15Feb 22, 2018Updated 7 years ago
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆16Sep 4, 2024Updated last year
- Quantification of Uncertainties in Neural Networks☆11Nov 11, 2025Updated 3 months ago
- [ICML 2025] Diff-MoE: Diffusion Transformer with Time-Aware and Space-Adaptive Experts☆24Nov 10, 2025Updated 3 months ago
- Repository for the paper "Interpreting Temporal Graph Neural Networks with Koopman Theory"☆11Jan 30, 2025Updated last year
- Double Descent results for FCNNs on MNIST, extended by Label Noise (Reconciling Modern Machine-Learning Practice and the Classical Bias–V…☆13Oct 24, 2023Updated 2 years ago
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated last year
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago