PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆39Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆90Jul 4, 2022Updated 3 years ago
- ☆12Oct 8, 2020Updated 5 years ago
- Code repository for the paper "Towards a Comprehensive Evaluation of Dimension Reduction Methods for Data Visualization"☆15Jul 4, 2024Updated last year
- KANs and MLPs☆12Jun 7, 2024Updated 2 years ago
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 11 months ago
- ☆28Jul 12, 2018Updated 7 years ago
- Code and data for HEF, published in The Web Conference 2021.☆17Mar 31, 2021Updated 5 years ago
- Bit-by-bit: Search Strategies, Resource Organization, Management & Sustainability; The Creation of Knowledge in the Digital Era☆14Nov 22, 2024Updated last year
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- PLLay: Efficient Topological Layer based on Persistence Landscapes☆23Dec 10, 2020Updated 5 years ago
- "Learning Discrete and Continuous Factors of Data via Alternating Disentanglement" accepted at ICML2019☆22Aug 22, 2019Updated 6 years ago
- ☆16Sep 2, 2020Updated 5 years ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆16Oct 14, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- Routes for speed.☆16Jan 2, 2025Updated last year
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆14Feb 18, 2020Updated 6 years ago
- An ASE-friendly implementation of the amorphous-to-crystalline (a2c) workflow.☆18Oct 19, 2025Updated 8 months ago
- ☆18Dec 3, 2020Updated 5 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection☆23Jan 21, 2021Updated 5 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆22Mar 12, 2026Updated 3 months ago
- ☆17Dec 21, 2023Updated 2 years ago
- A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews☆12Sep 22, 2023Updated 2 years ago
- CONditionals for Ordinal Regression and classification in PyTorch☆12Nov 5, 2022Updated 3 years ago
- Python re-implementation of szabo.f☆10Jul 30, 2015Updated 10 years ago
- Statistics on most cited papers in recent years of each conferences☆13Oct 24, 2018Updated 7 years ago
- VeLO optimizer in PyTorch☆20Feb 6, 2023Updated 3 years ago
- Deep learning toolkit for multi-input multi-output sequence modelling with tensorflow☆18Jan 18, 2018Updated 8 years ago
- ☆19Apr 16, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Julia enum made nicer☆10May 28, 2020Updated 6 years ago
- Code for creating recurrent neural network with rotational dynamics. Model is discussed in detail in "Rotational Dynamics Reduce Interfer…☆17Jul 23, 2020Updated 5 years ago
- ☆15Jul 6, 2015Updated 10 years ago
- ☆15Apr 26, 2025Updated last year
- Official Implementation of Infinite-Resolution Integral Noise Warping for Diffusion Models [ICLR 2025]☆16Mar 15, 2025Updated last year
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 4 years ago
- Agents for intelligence and coordination☆24Updated this week