PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆39Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆88Jul 4, 2022Updated 3 years ago
- ☆83Oct 11, 2022Updated 3 years ago
- Repository for content for the AMLD2020 workshop "Spiking neural networks for real-time inference tasks"☆17Oct 28, 2020Updated 5 years ago
- ☆12Oct 8, 2020Updated 5 years ago
- ☆10Jan 25, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- CVPR 2019 paper "Disentangling Adversarial Robustness and Generalization".☆14Oct 28, 2019Updated 6 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- KANs and MLPs☆12Jun 7, 2024Updated last year
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 8 months ago
- ☆14Jun 22, 2025Updated 9 months ago
- Anubis (formerly known as Benchmark AI), measures the goodness of machine learning workloads☆19Nov 8, 2022Updated 3 years ago
- A graphql client to get your subscriptions through tough firewalls and unreliable mobile networks☆14Sep 4, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- [NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…☆13Oct 7, 2022Updated 3 years ago
- An implementation of DecorrelatedBN by tensorflow☆13Jun 30, 2022Updated 3 years ago
- ☆16Oct 20, 2025Updated 5 months ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- ☆21Sep 6, 2021Updated 4 years ago
- Preprint classes for biological journals☆31Feb 19, 2026Updated last month
- Companion code to the preprint: E Bıyık, K Wang, N Anari, D Sadigh, "Batch Active Learning using Determinantal Point Processes". arXiv pr…☆15Jul 25, 2024Updated last year
- ☆21Jan 23, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Sep 2, 2020Updated 5 years ago
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- Orthogonal Matching Pursuit, parallelized on both CPU and GPU. 100x+ Speedup☆16Updated this week
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆14Feb 18, 2020Updated 6 years ago
- ☆14Jul 17, 2024Updated last year
- Algorithm to detect bursts in the EEG of preterm infants (Python version of an existing Matlab program)☆11Feb 17, 2020Updated 6 years ago
- ☆17Dec 3, 2020Updated 5 years ago
- Accompanying code for our NeurIPS 2019 paper☆11Nov 7, 2019Updated 6 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An Android client for the Weather Underground Weather API☆57Sep 29, 2014Updated 11 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- ☆17Dec 21, 2023Updated 2 years ago
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆21Mar 12, 2026Updated 2 weeks ago
- CONditionals for Ordinal Regression and classification in PyTorch☆12Nov 5, 2022Updated 3 years ago
- Nomalizing flows for orbita-free DFT☆11Sep 20, 2024Updated last year
- ☆19Apr 16, 2022Updated 3 years ago