Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.
☆42Sep 23, 2023Updated 2 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆86Jul 4, 2022Updated 3 years ago
- Implementation of the work Variational multiple shooting for Bayesian ODEs with Gaussian processes☆13Aug 5, 2022Updated 3 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Nov 23, 2023Updated 2 years ago
- ☆16May 16, 2023Updated 2 years ago
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆24Oct 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆16Jun 28, 2024Updated last year
- ☆14Apr 9, 2025Updated 11 months ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 2 years ago
- ☆11Aug 29, 2022Updated 3 years ago
- ☆24Jul 25, 2024Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆53Aug 10, 2025Updated 7 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆12Jun 10, 2024Updated last year
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- Библиотека-обертка, которая позволяет получить доступ к функционалу Quik из Python☆11Feb 16, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 2 years ago
- Calculate expected profit & loss for options☆15Aug 5, 2019Updated 6 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- ☆25May 20, 2025Updated 10 months ago
- Chatbot for the good vibes.☆11Aug 29, 2024Updated last year
- ☆19Apr 10, 2024Updated last year
- A lammps fix module to perform path integral molecular dynamics (PIMD) tasks.☆11May 28, 2022Updated 3 years ago
- This repository gathers the SchNet4AIM code along with some instructions and readme files.☆15Mar 13, 2024Updated 2 years ago
- The Nutmeg machine learning models☆11Jan 23, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implement Conditional VAE and train on MNIST by tensorflow 1.3.0.☆10Nov 7, 2017Updated 8 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆14Jul 25, 2023Updated 2 years ago
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆11Aug 11, 2025Updated 7 months ago
- Deribit bot to Delta Hedge Strategy.☆13Oct 2, 2022Updated 3 years ago
- MCP Server Implementation on Kakao Developers API to connect an AI Agent☆14Jun 26, 2025Updated 9 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆13May 14, 2024Updated last year
- ☆18Nov 23, 2023Updated 2 years ago
- A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022☆34Oct 9, 2023Updated 2 years ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Jul 30, 2025Updated 7 months ago
- The deribit_historical_trades repository gathers cryptocurrency (BTC, ETH, SOL, USDC) derivatives traded on the cryptocurrency derivative…☆24Mar 1, 2023Updated 3 years ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Nov 22, 2023Updated 2 years ago
- ☆18Jul 29, 2024Updated last year
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- [EMNLP 22] UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression☆33Dec 7, 2022Updated 3 years ago
- Helps create datasets scraped from Google Images☆12Oct 31, 2018Updated 7 years ago