This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆294Apr 10, 2024Updated last year
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆149Feb 17, 2023Updated 3 years ago
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆27Aug 30, 2022Updated 3 years ago
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆62Feb 3, 2024Updated 2 years ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆207Dec 27, 2024Updated last year
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆30Nov 8, 2024Updated last year
- ScalingOpt - Optimization Community☆83Mar 22, 2026Updated last week
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- ☆74Dec 7, 2024Updated last year
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆777Jul 10, 2025Updated 8 months ago
- ☆20Jan 4, 2023Updated 3 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 3 months ago
- ☆14Oct 18, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆395Jan 7, 2026Updated 2 months ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- ☆13Jul 2, 2025Updated 8 months ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- ☆35Dec 5, 2022Updated 3 years ago
- ☆27Jun 12, 2025Updated 9 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆61Jan 14, 2025Updated last year
- ☆88Jul 18, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- SOLNP+: A derivative-free optimization software☆24May 13, 2025Updated 10 months ago
- Awesome papers in machine learning theory☆10Feb 12, 2022Updated 4 years ago
- Experiments on trade-off among optimization, generalization and conflict aversion in multi-objective learning (MOL), and introducing MoDo…☆15Oct 21, 2023Updated 2 years ago
- Repository for the paper "Interpreting Temporal Graph Neural Networks with Koopman Theory"☆12Jan 30, 2025Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- ICLR 2021: Noise against noise: stochastic label noise helps combat inherent label noise☆15May 1, 2021Updated 4 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆173Nov 10, 2019Updated 6 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- ☆26Feb 2, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- ☆23Nov 1, 2022Updated 3 years ago
- Official implementation of the paper "Neural Hamilton: Can A.I. Understand Hamiltonian Mechanics?"☆14Feb 9, 2026Updated last month
- ☆35Jun 13, 2023Updated 2 years ago
- Sort out the researchers in the field of AI for Science☆19Apr 5, 2023Updated 2 years ago
- Finetune Google's pre-trained ViT models from HuggingFace's model hub.☆19Apr 4, 2021Updated 4 years ago
- Library for computing the Finite-time Lyapunov Exponents of 2D flows using xarray☆10Apr 25, 2022Updated 3 years ago