teddykoker / grokkingLinks
PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆38Updated 4 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below
Sorting:
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆63Updated 4 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆107Updated 5 years ago
- paper lists and information on mean-field theory of deep learning☆79Updated 6 years ago
- Hessian spectral density estimation in TF and Jax☆124Updated 5 years ago
- Structured matrices for compressing neural networks☆67Updated 2 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆83Updated 3 years ago
- CHOP: An optimization library based on PyTorch, with applications to adversarial examples and structured neural network training.☆78Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆61Updated 3 years ago
- Code for NeurIPS 2019 paper: "Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes…☆249Updated 5 years ago
- ☆100Updated 4 years ago
- Code for: "Neural Rough Differential Equations for Long Time Series", (ICML 2021)☆122Updated 4 years ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆42Updated 5 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆48Updated 6 years ago
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆109Updated 2 years ago
- Convolutional Neural Tangent Kernel☆112Updated 6 years ago
- A centralized place for deep thinking code and experiments☆90Updated 2 years ago
- ☆172Updated last year
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆219Updated last week
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆25Updated 6 years ago
- Neural Turing Machines in pytorch☆49Updated 4 years ago
- ☆37Updated 4 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆148Updated 2 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago
- Repo to accompany paper "Implicit Self-Regularization in Deep Neural Networks..."☆47Updated 7 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆62Updated 2 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆74Updated 9 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- ☆33Updated 5 years ago
- Some small scale experiments for my blog posts 📝☆80Updated 3 years ago
- ☆68Updated 6 years ago