xie-lab-ml / deep-learning-dynamics-paper-listLinks

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.

☆281

Alternatives and similar repositories for deep-learning-dynamics-paper-list

Users that are interested in deep-learning-dynamics-paper-list are comparing it to the libraries listed below

Sorting:

WeiHuang05 / Awesome-Feature-Learning-in-Deep-Learning-Thoery
Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…
☆189Updated 7 months ago
MinghuiChen43 / awesome-deep-phenomena
A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
☆337Updated 2 weeks ago
zeke-xie / adaptive-inertia-adai
[ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…
☆150Updated 2 years ago
WeiHuang05 / Awesome_Large_Foundation_Model_Theory
Welcome to the 'In Context Learning Theory' Reading Group
☆29Updated 8 months ago
yataobian / awesome-ebm
Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)
☆323Updated 3 weeks ago
kwignb / NeuralTangentKernel-Papers
Neural Tangent Kernel Papers
☆115Updated 6 months ago
tengyuma / cs229m_notes
☆232Updated 2 years ago
epfml / optML-pku
summer school materials
☆44Updated 2 years ago
ZIYU-DEEP / Awesome-Information-Bottleneck
This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.
☆366Updated last year
leiwu0 / course.math_theory_nn
Summer course on mathematical theory of deep learning
☆53Updated 6 years ago
VITA-Group / Open-L2O
Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms
☆281Updated 2 years ago
miniHuiHui / awesome-high-order-neural-network
☆50Updated 10 months ago
changwxx / Awesome-Optimal-Transport-in-Deep-Learning
a collection of AWESOME things about Optimal Transport in Deep Learning
☆294Updated last year
ryanchankh / mcr2
Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)
☆200Updated 2 years ago
foocker / deeplearningtheory
☆260Updated 4 months ago
zyushun / hessian-spectrum
Code for the paper: Why Transformers Need Adam: A Hessian Perspective
☆60Updated 4 months ago
thuwzy / ZhuSuan-PyTorch
An Elegant Library for Bayesian Deep Learning in PyTorch
☆26Updated 2 years ago
Kaffaljidhmah2 / Arxiv-Recommender
☆52Updated last year
andyjm3 / Awesome-Riemannian-Optimization
This repo contains papers, books, tutorials and resources on Riemannian optimization.
☆37Updated 2 months ago
Ma-Lab-Berkeley / ReduNet
ReduNet
☆539Updated 3 years ago
ermongroup / sliced_score_matching
Code for reproducing results in the sliced score matching paper (UAI 2019)
☆147Updated 5 years ago
ZO-Bench / ZO-LLM
[ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆109Updated 3 weeks ago
kilianFatras / awesome-optimal-transport
A list of awesome papers and cool resources on optimal transport and its applications in general! As you will notice, this list is curren…
☆229Updated 4 years ago
locuslab / edge-of-stability
☆70Updated 7 months ago
neuralcollapse / neuralcollapse
Code reproducing Neural Collapse phenomenon on MSE and cross-entropy loss
☆14Updated 3 years ago
omihub777 / ViT-CIFAR
PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…
☆198Updated last year
xuzhiqin1990 / understanding_dl
A lecture note for understanding deep learning
☆346Updated last month
ICLR / Master-Template
Template and style files for ICLR
☆215Updated last month
alisiahkoohi / Langevin-dynamics
Sampling with gradient-based Markov Chain Monte Carlo approaches
☆105Updated last year
sail-sg / stde
Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024
☆113Updated 8 months ago