joelburget / mamba-saeLinks
Training and evaluating Sparse Autoencoders for Mamba
☆9Updated 6 months ago
Alternatives and similar repositories for mamba-sae
Users that are interested in mamba-sae are comparing it to the libraries listed below
Sorting:
- ☆12Updated 7 months ago
- Quantification of Uncertainty with Adversarial Models☆29Updated last year
- ☆13Updated this week
- ☆31Updated last year
- 🧮 Algebraic Positional Encodings.☆13Updated 4 months ago
- Understanding how features learned by neural networks evolve throughout training☆34Updated 7 months ago
- Code associated to papers on superposition (in ML interpretability)☆28Updated 2 years ago
- ☆17Updated 9 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 10 months ago
- ☆16Updated last year
- ☆22Updated 7 months ago
- Example of Dense Associative Memory training on MNIST☆36Updated 2 years ago
- Rust bindings to GAP (Groups, Algorithms, Programming)☆23Updated last year
- ☆29Updated 2 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- The Energy Transformer block, in JAX☆57Updated last year
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆27Updated 4 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Code for "The Expressive Power of Low-Rank Adaptation".☆20Updated last year
- ☆53Updated 8 months ago
- Explorations with Geoffrey Hinton's Forward Forward algoithm☆33Updated last year
- Fast singularity detection with kernel☆33Updated last year
- gzip Predicts Data-dependent Scaling Laws☆35Updated last year
- Griffin MQA + Hawk Linear RNN Hybrid☆87Updated last year
- Latent Large Language Models☆18Updated 9 months ago
- direct preference optimization with only 1 model copy :)☆14Updated last year
- RWKV model implementation☆38Updated last year
- Training GPTs to solve interaction nets☆17Updated 9 months ago
- Experiments on the impact of depth in transformers and SSMs.☆30Updated 7 months ago
- ☆68Updated 9 months ago