This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆298Apr 10, 2024Updated 2 years ago
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆151Feb 17, 2023Updated 3 years ago
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆61Feb 3, 2024Updated 2 years ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆205Apr 13, 2026Updated 3 weeks ago
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- ☆75Dec 7, 2024Updated last year
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆783Jul 10, 2025Updated 10 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆65Mar 11, 2025Updated last year
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆20Jan 31, 2021Updated 5 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 4 months ago
- ☆14Oct 18, 2021Updated 4 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆400Apr 21, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆271Mar 14, 2026Updated last month
- ☆13Jul 2, 2025Updated 10 months ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆39Feb 17, 2025Updated last year
- ☆35Dec 5, 2022Updated 3 years ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆61Jan 14, 2025Updated last year
- DELT: Data Efficacy for Language Model Training☆45Feb 12, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Awesome papers in machine learning theory☆10Feb 12, 2022Updated 4 years ago
- Experiments on trade-off among optimization, generalization and conflict aversion in multi-objective learning (MOL), and introducing MoDo…☆15Oct 21, 2023Updated 2 years ago
- Repository for the paper "Interpreting Temporal Graph Neural Networks with Koopman Theory"☆12Apr 7, 2026Updated last month
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- ICLR 2021: Noise against noise: stochastic label noise helps combat inherent label noise☆15May 1, 2021Updated 5 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆174Nov 10, 2019Updated 6 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- ☆26Feb 2, 2023Updated 3 years ago
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆23Nov 1, 2022Updated 3 years ago
- Official implementation of the paper "Neural Hamilton: Can A.I. Understand Hamiltonian Mechanics?"☆14Feb 9, 2026Updated 3 months ago
- A collection of research papers on low-precision training methods☆66May 10, 2025Updated 11 months ago
- Finetune Google's pre-trained ViT models from HuggingFace's model hub.☆19Apr 4, 2021Updated 5 years ago
- Library for computing the Finite-time Lyapunov Exponents of 2D flows using xarray☆10Apr 25, 2022Updated 4 years ago
- SE-PINN: Solving the Schrödinger Equation via Physics-Informed Machine Learning☆11Dec 17, 2025Updated 4 months ago
- Code for 'Periodic Activation Functions Induce Stationarity' (NeurIPS 2021)☆19Oct 27, 2021Updated 4 years ago