Omnigrok: Grokking Beyond Algorithmic Data
☆65Feb 24, 2023Updated 3 years ago
Alternatives and similar repositories for Omnigrok
Users that are interested in Omnigrok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆39Dec 7, 2021Updated 4 years ago
- ☆19Feb 28, 2025Updated last year
- ☆88Oct 11, 2022Updated 3 years ago
- We study toy models of skill learning.☆33Feb 3, 2026Updated 4 months ago
- ☆10Dec 17, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- ☆28Feb 1, 2023Updated 3 years ago
- ☆12Jan 9, 2024Updated 2 years ago
- ☆14Oct 18, 2021Updated 4 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆90Jul 4, 2022Updated 3 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆68Jan 26, 2026Updated 4 months ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆15Jul 25, 2023Updated 2 years ago
- The official repository for AdaMuon☆39Aug 27, 2025Updated 9 months ago
- ☆29Mar 18, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Feb 19, 2024Updated 2 years ago
- ☆21Mar 1, 2023Updated 3 years ago
- ☆17Jun 20, 2024Updated last year
- ☆16May 16, 2023Updated 3 years ago
- ☆39Feb 18, 2025Updated last year
- Huggingface implementation of MVDream for easy import☆16Mar 31, 2025Updated last year
- ☆46Dec 12, 2023Updated 2 years ago
- ☆67Apr 12, 2025Updated last year
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆13Aug 9, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated last year
- NeurIPS23 "Flow Factorized Representation Learning"☆45Dec 15, 2025Updated 6 months ago
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 3 years ago
- ☆10May 8, 2024Updated 2 years ago
- ☆75Jul 15, 2024Updated last year
- The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)☆42Jan 21, 2024Updated 2 years ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- (CVPR 2024) Accelerating Neural Field Training via Soft Mining☆41Dec 2, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆34Nov 30, 2025Updated 6 months ago
- ☆33Jan 7, 2025Updated last year
- ☆22Updated this week
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 10 months ago
- ☆36Feb 26, 2023Updated 3 years ago
- A python implementation of the Ensemble Biclustering for Classification (EBC) algorithm. EBC is a co-clustering algorithm that allows you…☆20Apr 7, 2017Updated 9 years ago
- https://openreview.net/pdf?id=V5XDYSRcQP☆14May 22, 2025Updated last year