teddykoker / grokking
PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆31Updated 2 years ago
Related projects: ⓘ
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆55Updated 2 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 5 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆57Updated 3 years ago
- ☆35Updated 2 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆93Updated 4 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated last year
- ☆65Updated 5 years ago
- Hessian spectral density estimation in TF and Jax☆112Updated 4 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (affine group preconditioner, low-rank approximation preconditioner …☆105Updated this week
- codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"☆49Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆58Updated 2 years ago
- ☆21Updated last year
- Official code for UnICORNN (ICML 2021)☆27Updated 2 years ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆34Updated 3 years ago
- ☆35Updated last year
- ☆96Updated 2 years ago
- Image augmentation library for Jax☆36Updated 5 months ago
- Structured matrices for compressing neural networks☆65Updated 11 months ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆103Updated 2 years ago
- ☆32Updated 11 months ago
- paper lists and information on mean-field theory of deep learning☆75Updated 5 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆55Updated 3 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆35Updated last year
- Autoregressive Energy Machines☆77Updated last year
- 👩 Pytorch and Jax code for the Madam optimiser.☆50Updated 3 years ago
- ☆52Updated last month
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆21Updated 4 years ago
- ☆78Updated 3 years ago
- Collection of snippets for PyTorch users☆26Updated 2 years ago