PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆39Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Omnigrok: Grokking Beyond Algorithmic Data☆64Feb 24, 2023Updated 3 years ago
- Feedback-based Online Local Learning Of Weights (FOLLOW)☆12Feb 13, 2018Updated 8 years ago
- ☆12Oct 8, 2020Updated 5 years ago
- Code repository for the paper "Towards a Comprehensive Evaluation of Dimension Reduction Methods for Data Visualization"☆13Jul 4, 2024Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- KANs and MLPs☆12Jun 7, 2024Updated last year
- Anubis (formerly known as Benchmark AI), measures the goodness of machine learning workloads☆19Nov 8, 2022Updated 3 years ago
- A graphql client to get your subscriptions through tough firewalls and unreliable mobile networks☆14Sep 4, 2025Updated 8 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 9 months ago
- An implementation of DecorrelatedBN by tensorflow☆13Jun 30, 2022Updated 3 years ago
- The continuing saga of displacement and normals in three.js☆18Jan 5, 2015Updated 11 years ago
- Yet another react component to render markdown.☆16May 28, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Bullseye Polytope Clean-Label Poisoning Attack☆17Nov 5, 2020Updated 5 years ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- ☆21Sep 6, 2021Updated 4 years ago
- A nim module to handle polynomials☆13Jun 7, 2022Updated 3 years ago
- "Learning Discrete and Continuous Factors of Data via Alternating Disentanglement" accepted at ICML2019☆22Aug 22, 2019Updated 6 years ago
- ☆15Sep 2, 2020Updated 5 years ago
- Routes for speed.☆16Jan 2, 2025Updated last year
- ☆28Feb 17, 2024Updated 2 years ago
- solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning☆23Jan 19, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Dec 3, 2020Updated 5 years ago
- Accompanying code for our NeurIPS 2019 paper☆11Nov 7, 2019Updated 6 years ago
- Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection☆22Jan 21, 2021Updated 5 years ago
- Cleaned daily reports and time series data from the 2019 Novel Coronavirus COVID-19 (2019-nCoV) Data Repository by Johns Hopkins Universi…☆12Oct 27, 2023Updated 2 years ago
- An Android client for the Weather Underground Weather API☆57Sep 29, 2014Updated 11 years ago
- interactively identify related Authors on arxiv☆14Sep 22, 2023Updated 2 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- ☆17Dec 21, 2023Updated 2 years ago
- CONditionals for Ordinal Regression and classification in PyTorch☆12Nov 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated 11 months ago
- Python re-implementation of szabo.f☆10Jul 30, 2015Updated 10 years ago
- github.js + gist + thebe === quick runnable nbviewer☆20Jan 6, 2016Updated 10 years ago
- ☆19Apr 16, 2022Updated 4 years ago
- A MATLAB toolbox for interacting with the Allen Brain Observatory☆22Mar 30, 2026Updated last month
- Code for creating recurrent neural network with rotational dynamics. Model is discussed in detail in "Rotational Dynamics Reduce Interfer…☆17Jul 23, 2020Updated 5 years ago
- A simple flask upload program for multiple files requiring credential☆17Nov 14, 2020Updated 5 years ago