zeke-xie / deep-learning-dynamics-paper-list
This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆249Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for deep-learning-dynamics-paper-list
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆141Updated last year
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆143Updated 2 weeks ago
- Welcome to the 'In Context Learning Theory' Reading Group☆22Updated this week
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆270Updated this week
- This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.☆318Updated 5 months ago
- Neural Tangent Kernel Papers☆92Updated 8 months ago
- ☆210Updated last year
- [Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…☆33Updated 3 years ago
- Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)☆276Updated 11 months ago
- Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms☆262Updated last year
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆27Updated 2 years ago
- ☆37Updated last month
- ☆240Updated 6 months ago
- ☆59Updated 3 years ago
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆57Updated 9 months ago
- A lecture note for understanding deep learning☆188Updated 3 months ago
- PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…☆170Updated 9 months ago
- summer school materials☆45Updated last year
- a collection of AWESOME things about Optimal Transport in Deep Learning☆195Updated 5 months ago
- A simple code for plotting figure, colorbar, and cropping with python☆370Updated 2 years ago
- Solution and Useful Links☆37Updated 2 years ago
- ReduNet☆532Updated 2 years ago
- Official implementation of "Multi-Task Learning as a Bargaining Game" [ICML 2022]☆207Updated 5 months ago
- Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)☆192Updated last year
- ☆179Updated 11 months ago
- Official implementation for Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models (ICML 2022), and a re…☆102Updated 2 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆40Updated 6 months ago
- Collection of papers on state-space models☆549Updated this week
- ☆66Updated 5 years ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆73Updated 4 months ago