teddykoker / grokkingLinks
PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆37Updated 3 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below
Sorting:
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62Updated 4 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆77Updated 3 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆105Updated 4 years ago
- ☆192Updated 3 weeks ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- A centralized place for deep thinking code and experiments☆85Updated last year
- Hessian spectral density estimation in TF and Jax☆123Updated 4 years ago
- paper lists and information on mean-field theory of deep learning☆78Updated 6 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Image augmentation library for Jax☆39Updated last year
- Some small scale experiments for my blog posts 📝☆79Updated 3 years ago
- ☆26Updated 2 years ago
- ☆37Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated last month
- Structured matrices for compressing neural networks☆67Updated last year
- ☆56Updated 3 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆112Updated 3 years ago
- Framework-agnostic library for checking array/tensor shapes at runtime.☆46Updated 4 years ago
- Convolutional Neural Tangent Kernel☆111Updated 5 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆100Updated 3 years ago
- Scaling scaling laws with board games.☆49Updated 2 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆125Updated last year
- Code for: "Neural Rough Differential Equations for Long Time Series", (ICML 2021)☆118Updated 4 years ago
- Code for NeurIPS 2019 paper: "Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes…☆243Updated 4 years ago
- A library to create and manage configuration files, especially for machine learning projects.☆78Updated 3 years ago
- codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"☆49Updated 2 years ago
- CHOP: An optimization library based on PyTorch, with applications to adversarial examples and structured neural network training.☆77Updated last year
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆107Updated last year