xie-lab-ml / deep-learning-dynamics-paper-listLinks
This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
☆278Updated last year
Alternatives and similar repositories for deep-learning-dynamics-paper-list
Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below
Sorting:
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆185Updated 6 months ago
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆150Updated 2 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆28Updated 8 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆330Updated this week
- ☆230Updated 2 years ago
- Neural Tangent Kernel Papers☆115Updated 6 months ago
- This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.☆365Updated last year
- summer school materials☆44Updated last year
- ☆50Updated 9 months ago
- Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)☆313Updated this week
- Summer course on mathematical theory of deep learning☆52Updated 5 years ago
- a collection of AWESOME things about Optimal Transport in Deep Learning☆286Updated last year
- Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms☆281Updated 2 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆59Updated 4 months ago
- ☆70Updated 7 months ago
- ☆259Updated 4 months ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆49Updated 3 years ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆105Updated last week
- Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)☆199Updated 2 years ago
- Collection of papers on state-space models☆594Updated 2 months ago
- Code for reproducing results in the sliced score matching paper (UAI 2019)☆147Updated 5 years ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆110Updated 7 months ago
- PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…☆198Updated last year
- A Telegram bot to recommend arXiv papers☆276Updated 3 months ago
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆37Updated last month
- Mutual Information Neural Estimation in Pytorch☆329Updated 9 months ago
- Deep Learning Theory course☆25Updated 3 years ago
- Visualization of mean field and neural tangent kernel regime☆20Updated 11 months ago
- Processed / Cleaned Data for Paper Copilot☆520Updated 3 weeks ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆18Updated 2 years ago