xie-lab-ml / deep-learning-dynamics-paper-listLinks
This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆292Updated last year
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below
Sorting:
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆203Updated last year
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆150Updated 2 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆383Updated last week
- Neural Tangent Kernel Papers☆120Updated last year
- Welcome to the 'In Context Learning Theory' Reading Group☆30Updated last year
- ☆241Updated 3 years ago
- Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)☆357Updated 6 months ago
- This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.☆383Updated last year
- ☆56Updated last year
- summer school materials☆46Updated 2 years ago
- Summer course on mathematical theory of deep learning☆53Updated 6 years ago
- Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms☆294Updated 2 years ago
- a collection of AWESOME things about Optimal Transport in Deep Learning☆333Updated last year
- Template and style files for ICLR☆255Updated 4 months ago
- A list of awesome papers and cool resources on optimal transport and its applications in general! As you will notice, this list is curren…☆243Updated 4 years ago
- Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)☆202Updated 3 years ago
- ☆73Updated last year
- Code for reproducing results in the sliced score matching paper (UAI 2019)☆149Updated 6 years ago
- ☆50Updated 2 years ago
- ReduNet☆544Updated 3 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 4 years ago
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆122Updated 6 months ago
- Solution and Useful Links☆64Updated 3 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Updated 10 months ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Updated 2 years ago
- ☆241Updated last year
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆126Updated last year
- Collection of papers on state-space models☆614Updated 2 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆69Updated last year
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆52Updated last month