This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆301Apr 10, 2024Updated 2 years ago
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆150Feb 17, 2023Updated 3 years ago
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆28Aug 30, 2022Updated 3 years ago
- [Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…☆33Aug 3, 2021Updated 4 years ago
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆61Feb 3, 2024Updated 2 years ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆211Apr 13, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- ScalingOpt - Optimization Community☆100Jun 1, 2026Updated 2 weeks ago
- ☆76Dec 7, 2024Updated last year
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆787Jul 10, 2025Updated 11 months ago
- ☆20Jan 4, 2023Updated 3 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆65Mar 11, 2025Updated last year
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆20Jan 31, 2021Updated 5 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆402May 29, 2026Updated 2 weeks ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- ☆271Mar 14, 2026Updated 3 months ago
- ☆13Jul 2, 2025Updated 11 months ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 4 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- Official implementation for the paper "Controlled Sparsity via Constrained Optimization"☆12Aug 10, 2022Updated 3 years ago
- ☆35Dec 5, 2022Updated 3 years ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆62Jan 14, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆90Jul 18, 2023Updated 2 years ago
- Awesome papers in machine learning theory☆10Feb 12, 2022Updated 4 years ago
- Experiments on trade-off among optimization, generalization and conflict aversion in multi-objective learning (MOL), and introducing MoDo…☆15Oct 21, 2023Updated 2 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- ☆28Jun 12, 2025Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆173Nov 10, 2019Updated 6 years ago
- ☆26Feb 2, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- ☆35Jun 13, 2023Updated 3 years ago
- Finetune Google's pre-trained ViT models from HuggingFace's model hub.☆19Apr 4, 2021Updated 5 years ago
- Library for computing the Finite-time Lyapunov Exponents of 2D flows using xarray☆10Apr 25, 2022Updated 4 years ago
- SE-PINN: Solving the Schrödinger Equation via Physics-Informed Machine Learning☆11Dec 17, 2025Updated 6 months ago
- Code for 'Periodic Activation Functions Induce Stationarity' (NeurIPS 2021)☆20Oct 27, 2021Updated 4 years ago
- ☆198Jun 9, 2026Updated last week