goodfire-ai / spdLinks
Stochastic Parameter Decomposition
☆27Updated this week
Alternatives and similar repositories for spd
Users that are interested in spd are comparing it to the libraries listed below
Sorting:
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Code for the paper "Function-Space Learning Rates"☆20Updated last month
- Universal Neurons in GPT2 Language Models☆30Updated last year
- Simple repository for training small reasoning models☆33Updated 5 months ago
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆25Updated 7 months ago
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆56Updated 9 months ago
- Code repo for the model organisms and convergent directions of EM papers.☆17Updated last week
- ☆53Updated 9 months ago
- ☆31Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 8 months ago
- Code for☆27Updated 7 months ago
- ☆32Updated 9 months ago
- Jax like function transformation engine but micro, microjax☆33Updated 8 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆46Updated 3 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆75Updated 8 months ago
- Attribution-based Parameter Decomposition☆26Updated last month
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆127Updated 2 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- ☆37Updated last year
- Code for experiments on transformers using Markovian data.☆17Updated 7 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 5 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆77Updated 7 months ago
- ☆23Updated 5 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆57Updated 8 months ago
- ☆26Updated 2 years ago
- ☆59Updated 8 months ago
- LLM training in simple, raw C/CUDA☆15Updated 7 months ago
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 5 months ago