scienceetonnante / grokkingLinks
Demonstration of the grokking phenomenon in machine learning in a simple case
☆61Updated 8 months ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.☆260Updated last month
- Uncertainty quantification with PyTorch☆374Updated last week
- The boundary of neural network trainability is fractal☆217Updated last year
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆173Updated 2 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆26Updated last year
- 🧠 Starter templates for doing interpretability research☆75Updated 2 years ago
- Tools for studying developmental interpretability in neural networks.☆109Updated 3 months ago
- Parameter-Free Optimizers for Pytorch☆131Updated last year
- Mechanistic Interpretability Visualizations using React☆293Updated 10 months ago
- ☆11Updated last year
- My writings about ARC (Abstraction and Reasoning Corpus)☆85Updated last week
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆228Updated 2 months ago
- Erasing concepts from neural representations with provable guarantees☆238Updated 8 months ago
- ☆354Updated 2 months ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆311Updated 2 months ago
- ☆22Updated 6 months ago
- A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)☆21Updated 4 months ago
- ☆23Updated last week
- git extension for {collaborative, communal, continual} model development☆215Updated 11 months ago
- Sparse Autoencoder for Mechanistic Interpretability☆272Updated last year
- ☆546Updated last year
- Course materials of "Bayesian Modelling and Probabilistic Programming with Numpyro, and Deep Generative Surrogates for Epidemiology"☆70Updated 7 months ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆191Updated 2 years ago
- epsilon machines and transformers!☆31Updated 3 months ago
- A framework for conducting machine learning experiments in python☆42Updated 10 months ago
- Hierarchical Associative Memory User Experience☆104Updated 3 months ago
- ☆84Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆79Updated 3 years ago
- Sparsify transformers with SAEs and transcoders☆640Updated last week