wattenberg / superpositionLinks
Code associated to papers on superposition (in ML interpretability)
☆33Updated 3 years ago
Alternatives and similar repositories for superposition
Users that are interested in superposition are comparing it to the libraries listed below
Sorting:
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆170Updated 4 months ago
 - Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
 - ☆166Updated 2 years ago
 - ☆27Updated 2 years ago
 - ☆91Updated last year
 - nanoGPT-like codebase for LLM training☆110Updated this week
 - unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆79Updated 3 years ago
 - Understand and test language model architectures on synthetic tasks.☆234Updated last month
 - A MAD laboratory to improve AI architecture designs 🧪☆132Updated 10 months ago
 - ☆60Updated last year
 - Sparse Autoencoder Training Library☆55Updated 6 months ago
 - ☆32Updated 7 months ago
 - Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
 - ☆130Updated 2 years ago
 - ☆70Updated 3 years ago
 - Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated last year
 - Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
 - Omnigrok: Grokking Beyond Algorithmic Data☆62Updated 2 years ago
 - seqax = sequence modeling + JAX☆168Updated 3 months ago
 - ☆53Updated last year
 - Attribution-based Parameter Decomposition☆31Updated 4 months ago
 - Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated last year
 - JAX implementation of the Mistral 7b v0.2 model☆34Updated last year
 - Understanding how features learned by neural networks evolve throughout training☆39Updated last year
 - ☆33Updated last year
 - Resources from the EleutherAI Math Reading Group☆54Updated 8 months ago
 - A set of Python scripts that makes your experience on TPU better☆54Updated last month
 - ☆37Updated 8 months ago
 - [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
 - supporting pytorch FSDP for optimizers☆83Updated 10 months ago