unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆86Jul 4, 2022Updated 3 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Omnigrok: Grokking Beyond Algorithmic Data☆63Feb 24, 2023Updated 3 years ago
- ☆12Jul 30, 2025Updated 7 months ago
- Official Implementation of PatentLMM (our AAAI 2025 Paper)☆18Jan 28, 2025Updated last year
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- In-silico design pipeline for evaluating protein structure diffusion models.☆30Jun 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆27Oct 23, 2025Updated 5 months ago
- Code for the AISTATS 2024 Paper "From Data Imputation to Data Cleaning - Automated Cleaning of Tabular Data Improves Downstream Predictiv…☆24Feb 14, 2024Updated 2 years ago
- ☆14Apr 9, 2025Updated 11 months ago
- command line fractal rendering☆13Mar 25, 2022Updated 4 years ago
- Tools for studying developmental interpretability in neural networks.☆128Dec 30, 2025Updated 2 months ago
- ☆14Mar 4, 2024Updated 2 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆35Oct 28, 2025Updated 4 months ago
- Predict scalar coupling in molecules☆15Mar 14, 2021Updated 5 years ago
- Pruning is all you need (hopefully)☆12Sep 7, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆20Oct 21, 2021Updated 4 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆20May 1, 2023Updated 2 years ago
- On the roots of beauty☆13Nov 27, 2022Updated 3 years ago
- A Windows program to search for cellular automata patterns☆16Jan 24, 2013Updated 13 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- ☆15Jun 10, 2022Updated 3 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆72May 18, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An agent for playing Atari games running on a Teensy microcontroller☆15Nov 11, 2022Updated 3 years ago
- ☆15Apr 1, 2020Updated 5 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Ini kumpulan beberapa materi lab pada Digitalent Schoolarship Python Essentials 2019☆10Mar 27, 2022Updated 4 years ago
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- code associated with paper "Sparse Bayesian Optimization"☆26Oct 31, 2023Updated 2 years ago
- Proof recording for Lean 3☆27Sep 30, 2021Updated 4 years ago
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Computationally friendly hyper-parameter search with DP-SGD☆25Jan 7, 2025Updated last year
- A Rougelike Peer-to-Peer Multi Player Dungeon Explorer Game written in Rust☆10Feb 12, 2022Updated 4 years ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆86Jul 28, 2024Updated last year
- Protein scoring and sampling of 'Combinatorial Variant Effects from Structure' (CoVES)☆11Jan 5, 2024Updated 2 years ago
- Learning to Combine Local and Global Image Information for Contactless Palmprint Recognition☆11Dec 7, 2021Updated 4 years ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated last year