thegregyang / NTK4ALinks
Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"
☆105Updated 5 years ago
Alternatives and similar repositories for NTK4A
Users that are interested in NTK4A are comparing it to the libraries listed below
Sorting:
- Convolutional Neural Tangent Kernel☆112Updated 5 years ago
- ☆67Updated 6 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆215Updated 3 weeks ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆176Updated 5 years ago
- NTK reading group☆87Updated 5 years ago
- Hessian spectral density estimation in TF and Jax☆124Updated 5 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆131Updated 6 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆147Updated 2 years ago
- paper lists and information on mean-field theory of deep learning☆78Updated 6 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆63Updated 4 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated 2 years ago
- ☆157Updated 3 years ago
- ☆124Updated last year
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆140Updated 6 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆76Updated last year
- ☆59Updated 2 years ago
- ☆100Updated 3 years ago
- Code for NeurIPS 2019 paper: "Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes…☆247Updated 5 years ago
- ☆133Updated 4 years ago
- Monotone operator equilibrium networks☆53Updated 5 years ago
- ☆170Updated last year
- ☆36Updated 4 years ago
- Optimization with orthogonal constraints and on general manifolds☆130Updated 5 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 3 years ago
- Structured matrices for compressing neural networks☆67Updated 2 years ago
- ☆70Updated 9 months ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 6 years ago
- ☆30Updated 4 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 6 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆42Updated 6 years ago