edwardjhu / TP4
Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)
☆62Updated 3 years ago
Alternatives and similar repositories for TP4:
Users that are interested in TP4 are comparing it to the libraries listed below
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆106Updated 4 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆58Updated last year
- Hessian spectral density estimation in TF and Jax☆123Updated 4 years ago
- ☆99Updated 3 years ago
- ☆29Updated 4 years ago
- A centralized place for deep thinking code and experiments☆83Updated last year
- ☆67Updated 5 months ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Silly twitter torch implementations.☆46Updated 2 years ago
- Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)☆28Updated 4 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆174Updated this week
- PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆36Updated 3 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆36Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- ☆60Updated 3 years ago
- ☆35Updated last year
- ☆64Updated 2 years ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆39Updated 4 years ago
- ☆66Updated 6 years ago
- Open source code for EigenGame.☆30Updated last year
- ☆52Updated 7 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆109Updated 2 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 5 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago
- ☆37Updated 3 years ago
- Structured matrices for compressing neural networks☆66Updated last year
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆24Updated 5 years ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆57Updated 2 years ago