Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.
☆43May 2, 2026Updated last month
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆90Jul 4, 2022Updated 3 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆65Feb 24, 2023Updated 3 years ago
- Implementation of the work Variational multiple shooting for Bayesian ODEs with Gaussian processes☆14Aug 5, 2022Updated 3 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆11Nov 23, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Learning Debiased and Disentangled Representations for Semantic Segmentation (NeurIPS 2021)☆13Jan 23, 2022Updated 4 years ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- ☆14Oct 18, 2021Updated 4 years ago
- chrome extension☆18Jun 29, 2020Updated 5 years ago
- ☆24Jul 25, 2024Updated last year
- PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆39Dec 7, 2021Updated 4 years ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆53Aug 10, 2025Updated 10 months ago
- kmeans algorithm for GPU, with and without triangle inequality☆13Apr 6, 2010Updated 16 years ago
- A practical guide to deep learning, for unconventional people.☆12Jan 6, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆238Jul 19, 2025Updated 10 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆12Jun 10, 2024Updated 2 years ago
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 3 years ago
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated 2 years ago
- Latest and fastest EigenPro that scales to billions of examples☆10Apr 18, 2026Updated last month
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆582Jun 28, 2024Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆87Jul 28, 2024Updated last year
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆19Oct 22, 2023Updated 2 years ago
- ☆25May 20, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆19Apr 10, 2024Updated 2 years ago
- ☆29Mar 18, 2023Updated 3 years ago
- This repository gathers the SchNet4AIM code along with some instructions and readme files.☆15Mar 13, 2024Updated 2 years ago
- The Nutmeg machine learning models☆11Jan 23, 2025Updated last year
- ☆27Jun 12, 2023Updated 2 years ago
- Implement Conditional VAE and train on MNIST by tensorflow 1.3.0.☆10Nov 7, 2017Updated 8 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆15Jul 25, 2023Updated 2 years ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆72Sep 25, 2025Updated 8 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆28May 8, 2024Updated 2 years ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated last year
- ☆12Jul 30, 2025Updated 10 months ago
- ☆11Aug 3, 2023Updated 2 years ago
- semantically labels kinect pointclouds☆22Aug 30, 2013Updated 12 years ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Nov 22, 2023Updated 2 years ago
- Implementation of IODINE model☆10Jun 7, 2019Updated 7 years ago