This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆299Apr 10, 2024Updated 2 years ago
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆150Feb 17, 2023Updated 3 years ago
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆28Aug 30, 2022Updated 3 years ago
- [Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…☆33Aug 3, 2021Updated 4 years ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆211Apr 13, 2026Updated last month
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- ScalingOpt - Optimization Community☆98May 23, 2026Updated last week
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- ☆76Dec 7, 2024Updated last year
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆783Jul 10, 2025Updated 10 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆65Mar 11, 2025Updated last year
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆20Jan 31, 2021Updated 5 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Oct 18, 2021Updated 4 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆401May 19, 2026Updated last week
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- ☆271Mar 14, 2026Updated 2 months ago
- ☆13Jul 2, 2025Updated 10 months ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆39Feb 17, 2025Updated last year
- Official implementation for the paper "Controlled Sparsity via Constrained Optimization"☆12Aug 10, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆62Jan 14, 2025Updated last year
- ☆88Jul 18, 2023Updated 2 years ago
- Awesome papers in machine learning theory☆10Feb 12, 2022Updated 4 years ago
- Experiments on trade-off among optimization, generalization and conflict aversion in multi-objective learning (MOL), and introducing MoDo…☆15Oct 21, 2023Updated 2 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- Repository for the paper "Interpreting Temporal Graph Neural Networks with Koopman Theory"☆12Apr 7, 2026Updated last month
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ICLR 2021: Noise against noise: stochastic label noise helps combat inherent label noise☆15May 1, 2021Updated 5 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆173Nov 10, 2019Updated 6 years ago
- ☆26Feb 2, 2023Updated 3 years ago
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- ☆23Nov 1, 2022Updated 3 years ago
- Deep Learning Theory and Practice☆25Dec 5, 2023Updated 2 years ago
- Sort out the researchers in the field of AI for Science☆20Apr 5, 2023Updated 3 years ago