joelburget / mamba-sae
Training and evaluating Sparse Autoencoders for Mamba
☆9Updated last month
Related projects: ⓘ
- Sparse and discrete interpretability tool for neural networks☆51Updated 7 months ago
- Understanding how features learned by neural networks evolve throughout training☆30Updated this week
- Measuring generalization properties of graph neural networks☆13Updated last year
- ☆54Updated last week
- Evaluation of neuro-symbolic engines☆29Updated last month
- The Energy Transformer block, in JAX☆48Updated 9 months ago
- Quantification of Uncertainty with Adversarial Models☆27Updated last year
- ☆42Updated 3 months ago
- ☆16Updated 8 months ago
- gzip Predicts Data-dependent Scaling Laws☆31Updated 3 months ago
- Like ARC, but code to generate visual puzzles. 1D puzzles first.☆15Updated last month
- Implementation of Spectral State Space Models☆16Updated 6 months ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 2 years ago
- ☆21Updated last year
- ☆46Updated 7 months ago
- ☆66Updated last month
- A MAD laboratory to improve AI architecture designs 🧪☆84Updated 4 months ago
- ☆33Updated 3 months ago
- ☆23Updated 2 years ago
- A package for defining deep learning models using categorical algebraic expressions.☆53Updated last month
- ☆16Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!☆45Updated last month
- ☆23Updated 6 months ago
- ☆14Updated 3 weeks ago
- Understand and test language model architectures on synthetic tasks.☆156Updated 4 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated 10 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆74Updated 7 months ago
- Explorations with Geoffrey Hinton's Forward Forward algoithm☆32Updated 8 months ago
- Universal Neurons in GPT2 Language Models☆25Updated 3 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆26Updated last week