Sea-Snell / grokkingView external linksLinks
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆83Jul 4, 2022Updated 3 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below
Sorting:
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆42Sep 23, 2023Updated 2 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 3 months ago
- Russian coreference resolution made as simple and accessible as could be☆12Sep 3, 2022Updated 3 years ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- ☆12Jul 30, 2025Updated 6 months ago
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- ☆19Mar 25, 2025Updated 10 months ago
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago
- ☆19Feb 25, 2024Updated last year
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Jul 17, 2023Updated 2 years ago
- Graph Transformers for Large Graphs☆22Apr 26, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- Exploring Model Kinship for Merging Large Language Models☆27Apr 16, 2025Updated 9 months ago
- ☆21Aug 18, 2022Updated 3 years ago
- Official Implementation for NorMuon paper☆55Updated this week
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆48Sep 3, 2023Updated 2 years ago
- code associated with paper "Sparse Bayesian Optimization"☆26Oct 31, 2023Updated 2 years ago
- The GitBook documentation site for OpenAlex☆26Jan 18, 2026Updated 3 weeks ago
- Tools for studying developmental interpretability in neural networks.☆126Dec 30, 2025Updated last month
- ☆27Feb 1, 2023Updated 3 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆25Sep 13, 2024Updated last year
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- A diffusion model for structure-based drug design with faster inference from learned representations of protein structure.☆31Dec 18, 2023Updated 2 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆34Oct 28, 2025Updated 3 months ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆42Aug 7, 2025Updated 6 months ago
- ☆30Jan 17, 2022Updated 4 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆72Mar 26, 2023Updated 2 years ago
- ☆33May 15, 2024Updated last year
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated 11 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- ☆26Updated this week
- Simple Scalable Discrete Diffusion for text in PyTorch☆37Sep 27, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- Utilities for the HuggingFace transformers library☆74Jan 21, 2023Updated 3 years ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- Ini kumpulan beberapa materi lab pada Digitalent Schoolarship Python Essentials 2019☆10Mar 27, 2022Updated 3 years ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- A Model Context Protocol server that provides documentation access capabilities. This server enables LLMs to search and retrieve content …☆18Apr 29, 2025Updated 9 months ago