Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.
☆44May 2, 2026Updated last month
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆90Jul 4, 2022Updated 3 years ago
- ☆19Feb 28, 2025Updated last year
- Official implementation of "Multi-scale Feature Learning Dynamics: Insights for Double Descent".☆17Jun 10, 2022Updated 4 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆11Nov 23, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PKUAutoElective 服务器容器部署☆12Feb 23, 2024Updated 2 years ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- ☆14Oct 18, 2021Updated 4 years ago
- ☆24Jul 25, 2024Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆53Aug 10, 2025Updated 10 months ago
- kmeans algorithm for GPU, with and without triangle inequality☆13Apr 6, 2010Updated 16 years ago
- A practical guide to deep learning, for unconventional people.☆12Jan 6, 2018Updated 8 years ago
- Бэктестинг торговых стратегий с помощью библиотек на Python☆15May 26, 2025Updated last year
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆240Jul 19, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Continual Resilient (CoRe) Optimizer for PyTorch☆12Jun 10, 2024Updated 2 years ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 3 years ago
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated 2 years ago
- Calculate expected profit & loss for options☆15Aug 5, 2019Updated 6 years ago
- Latest and fastest EigenPro that scales to billions of examples☆10Apr 18, 2026Updated 2 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆582Jun 28, 2024Updated 2 years ago
- Official Implementation of PatentLMM (our AAAI 2025 Paper)☆26Jun 13, 2026Updated 2 weeks ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆88Jul 28, 2024Updated last year
- A Mechanistic Interpretability Analysis of Grokking☆28Sep 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆19Oct 22, 2023Updated 2 years ago
- ☆26Jun 9, 2026Updated 3 weeks ago
- A PyTorch implementation of SimSiam based on CVPR 2021 paper "Exploring Simple Siamese Representation Learning"☆12Mar 23, 2021Updated 5 years ago
- ☆19Apr 10, 2024Updated 2 years ago
- This repository gathers the SchNet4AIM code along with some instructions and readme files.☆15Mar 13, 2024Updated 2 years ago
- The Nutmeg machine learning models☆11Jan 23, 2025Updated last year
- Some examples and tests with LicheeRV Nano☆37Aug 9, 2025Updated 10 months ago
- Implement Conditional VAE and train on MNIST by tensorflow 1.3.0.☆10Nov 7, 2017Updated 8 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆15Jul 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Supabase MCP server compatible with cursor☆20Feb 13, 2025Updated last year
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆13Aug 11, 2025Updated 10 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated 2 years ago
- Deribit bot to Delta Hedge Strategy.☆13Oct 2, 2022Updated 3 years ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated last year
- The deribit_historical_trades repository gathers cryptocurrency (BTC, ETH, SOL, USDC) derivatives traded on the cryptocurrency derivative…☆24Mar 1, 2023Updated 3 years ago
- semantically labels kinect pointclouds☆22Aug 30, 2013Updated 12 years ago