Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.
☆43May 2, 2026Updated 2 weeks ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆90Jul 4, 2022Updated 3 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆65Feb 24, 2023Updated 3 years ago
- Official implementation of "Multi-scale Feature Learning Dynamics: Insights for Double Descent".☆17Jun 10, 2022Updated 3 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- Learning Debiased and Disentangled Representations for Semantic Segmentation (NeurIPS 2021)☆13Jan 23, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆26Oct 8, 2023Updated 2 years ago
- Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)☆21Sep 27, 2022Updated 3 years ago
- chrome extension☆18Jun 29, 2020Updated 5 years ago
- 源心社区的第一个开源项目:通过软件实现TRIZ理论。我们希望通过这个开源项目帮助更多人和组织创造性地解决问题☆16Apr 18, 2016Updated 10 years ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆53Aug 10, 2025Updated 9 months ago
- Official Implementation of PatentLMM (our AAAI 2025 Paper)☆23Jan 28, 2025Updated last year
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆238Jul 19, 2025Updated 10 months ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- Библиотека-обертка, которая позволяет получить доступ к функционалу Quik из Python☆12Feb 16, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 3 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 4 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆580Jun 28, 2024Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆87Jul 28, 2024Updated last year
- Yet another unofficial Xray server container with built in Nginx and acme.sh cert support on x86 and arm/arm64☆34Apr 10, 2026Updated last month
- ☆25May 20, 2025Updated 11 months ago
- ☆19Apr 10, 2024Updated 2 years ago
- A lammps fix module to perform path integral molecular dynamics (PIMD) tasks.☆11May 28, 2022Updated 3 years ago
- The Nutmeg machine learning models☆11Jan 23, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- Implement Conditional VAE and train on MNIST by tensorflow 1.3.0.☆10Nov 7, 2017Updated 8 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆15Jul 25, 2023Updated 2 years ago
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆13Aug 11, 2025Updated 9 months ago
- ☆22Dec 1, 2025Updated 5 months ago
- MCP Server Implementation on Kakao Developers API to connect an AI Agent☆17Jun 26, 2025Updated 10 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated 2 years ago
- ☆28May 8, 2024Updated 2 years ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Jul 30, 2025Updated 9 months ago
- Repo for the paper "Exploiting redundancy in large materials datasets for efficient machine learning with less data"☆11Sep 23, 2024Updated last year
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Nov 22, 2023Updated 2 years ago
- Implementation of IODINE model☆10Jun 7, 2019Updated 6 years ago
- ☆18Jul 29, 2024Updated last year
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- [NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…☆13Oct 7, 2022Updated 3 years ago