unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆85Jul 4, 2022Updated 3 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- Russian coreference resolution made as simple and accessible as could be☆12Sep 3, 2022Updated 3 years ago
- ☆12Jul 30, 2025Updated 7 months ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- Official Implementation of PatentLMM (our AAAI 2025 Paper)☆16Jan 28, 2025Updated last year
- Omnigrok: Grokking Beyond Algorithmic Data☆63Feb 24, 2023Updated 3 years ago
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- ☆19Mar 25, 2025Updated 11 months ago
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Jul 17, 2023Updated 2 years ago
- ☆14Jul 22, 2021Updated 4 years ago
- Computationally friendly hyper-parameter search with DP-SGD☆25Jan 7, 2025Updated last year
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆21Sep 18, 2025Updated 5 months ago
- The GitBook documentation site for OpenAlex☆25Jan 18, 2026Updated last month
- ☆21Aug 18, 2022Updated 3 years ago
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆48Sep 3, 2023Updated 2 years ago
- Code for the AISTATS 2024 Paper "From Data Imputation to Data Cleaning - Automated Cleaning of Tabular Data Improves Downstream Predictiv…☆24Feb 14, 2024Updated 2 years ago
- code associated with paper "Sparse Bayesian Optimization"☆26Oct 31, 2023Updated 2 years ago
- ☆29Nov 19, 2025Updated 3 months ago
- ☆37Feb 16, 2025Updated last year
- ☆27Feb 1, 2023Updated 3 years ago
- [Model Context Protocol] Dev Kit - anything a developer need for him day to day works☆31Apr 1, 2025Updated 11 months ago
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Sep 13, 2024Updated last year
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆34Oct 28, 2025Updated 4 months ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆43Aug 7, 2025Updated 7 months ago
- A system for automating selection and optimization of pre-trained models from the TAO Model Zoo☆29Jun 28, 2024Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- LLM sampling method for enforcing syntax adherence in generated output☆25May 31, 2023Updated 2 years ago
- MLX implementation of xLSTM model by Beck et al. (2024)☆32Jun 5, 2024Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆72Mar 26, 2023Updated 2 years ago
- ☆33May 15, 2024Updated last year
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated 11 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36May 2, 2023Updated 2 years ago
- Simple Scalable Discrete Diffusion for text in PyTorch☆37Sep 27, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago