danielmamay/grokking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/danielmamay/grokking)

danielmamay / grokking

Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.

☆44

Alternatives and similar repositories for grokking

Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sea-Snell / grokking
View on GitHub
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆90Jul 4, 2022Updated 4 years ago
KindXiaoming / Omnigrok
View on GitHub
Omnigrok: Grokking Beyond Algorithmic Data
☆65Feb 24, 2023Updated 3 years ago
d-doshi / Grokking
View on GitHub
☆19Feb 28, 2025Updated last year
neuroai / Go-CLS_v2
View on GitHub
☆16May 16, 2023Updated 3 years ago
bjmin / bmin_comphys
View on GitHub
computational physics (Chungbuk National University, Korea)
☆10May 26, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
andylolu2 / jax-vqvae-gpt
View on GitHub
Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.
☆11Nov 23, 2023Updated 2 years ago
aradha / deep_neural_feature_ansatz
View on GitHub
Code for verifying deep neural feature ansatz
☆22May 3, 2023Updated 3 years ago
mariogeiger / feature_lazy
View on GitHub
☆14Oct 18, 2021Updated 4 years ago
shunzh / mcts-for-llm
View on GitHub
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆16Jun 28, 2024Updated 2 years ago
jh-jeong / smoothmix
View on GitHub
Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)
☆21Sep 27, 2022Updated 3 years ago
teddykoker / grokking
View on GitHub
PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆39Dec 7, 2021Updated 4 years ago
trizchina / triz
View on GitHub
源心社区的第一个开源项目：通过软件实现TRIZ理论。我们希望通过这个开源项目帮助更多人和组织创造性地解决问题
☆17Apr 18, 2016Updated 10 years ago
kaistAI / Janus
View on GitHub
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆53Aug 10, 2025Updated 11 months ago
OSU-NLP-Group / GrokkedTransformer
View on GitHub
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆240Jul 19, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
nikcheerla / deeplearningschool
View on GitHub
A practical guide to deep learning, for unconventional people.
☆13Jan 6, 2018Updated 8 years ago
Laz4rz / mup
View on GitHub
Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation
☆14Jan 2, 2026Updated 6 months ago
ztjhz / t5-jax
View on GitHub
JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
☆24Jun 10, 2023Updated 3 years ago
ironjr / grokfast
View on GitHub
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
☆583Jun 28, 2024Updated 2 years ago
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
odhyan / options-calculator
View on GitHub
Calculate expected profit & loss for options
☆15Aug 5, 2019Updated 6 years ago
cleanlab / structured-output-benchmark
View on GitHub
A Structured Output Benchmark whose 'ground-truth' is actually right
☆22Dec 5, 2025Updated 7 months ago
pnnl / torchntk
View on GitHub
☆29Mar 18, 2023Updated 3 years ago
DT6A / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆19Oct 22, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
hanantabak2 / AI_Research_Assistant_CrewAI_RAG
View on GitHub
Building AI Research Assistant: Multi-Agent RAG System Reading From Multiple Unstructured Sources
☆23Jul 15, 2024Updated 2 years ago
asahi417 / ConditionalVariationalAutoEncoder
View on GitHub
Implement Conditional VAE and train on MNIST by tensorflow 1.3.0.
☆10Nov 7, 2017Updated 8 years ago
shreyansh26 / Red-Teaming-Language-Models-with-Language-Models
View on GitHub
A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022
☆35Oct 9, 2023Updated 2 years ago
PeterWang512 / AttributeByUnlearning
View on GitHub
Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."
☆17May 23, 2025Updated last year
jcottaar / seismic
View on GitHub
Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)
☆13Aug 11, 2025Updated 11 months ago
budzianowski / opengvl
View on GitHub
Open GVL
☆23Dec 1, 2025Updated 7 months ago
SlippyDong / supabase-mcp-cursor
View on GitHub
A Supabase MCP server compatible with cursor
☆20Feb 13, 2025Updated last year
TiesdeKok / acctg-579B-python-for-business-research
View on GitHub
This is a standalone version of my former ACCTG 579B phd class on Python programming for business research.
☆20Aug 22, 2023Updated 2 years ago
Zinxira / tlvmc-parkinsons-fog-prediction-4th-place-solution
View on GitHub
☆11Aug 3, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
kyrie-23 / linear_task_arithmetic
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
technion-cs-nlp / llm-arithmetic-heuristics
View on GitHub
☆26Jun 9, 2026Updated last month
microsoft / TraceCodegen
View on GitHub
☆27Jun 12, 2023Updated 3 years ago
nimamahmoudi / LLMStreamlitDemoBasic
View on GitHub
☆18Dec 26, 2023Updated 2 years ago
wildphoton / Compositional-Generalization
View on GitHub
[NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…
☆13Oct 7, 2022Updated 3 years ago
maragraziani / concept_discovery_svd
View on GitHub
Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…
☆14Nov 22, 2023Updated 2 years ago
MARIO-Math-Reasoning / MARIO
View on GitHub
☆28May 8, 2024Updated 2 years ago