goodfire-ai / spdLinks

Stochastic Parameter Decomposition

☆27

Alternatives and similar repositories for spd

Users that are interested in spd are comparing it to the libraries listed below

Sorting:

taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
edwardmilsom / function-space-learning-rates-paper
Code for the paper "Function-Space Learning Rates"
☆20Updated last month
wesg52 / universal-neurons
Universal Neurons in GPT2 Language Models
☆30Updated last year
tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 5 months ago
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
amudide / switch_sae
Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)
☆25Updated 7 months ago
tech-srl / layer_norm_expressivity_role
Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)
☆56Updated 9 months ago
clarifying-EM / model-organisms-for-EM
Code repo for the model organisms and convergent directions of EM papers.
☆17Updated last week
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
SHI-Labs / CompactNet
☆31Updated last year
EleutherAI / features-across-time
Understanding how features learned by neural networks evolve throughout training
☆36Updated 8 months ago
brantondemoss / GrokkingComplexity
Code for
☆27Updated 7 months ago
AndPotap / einsum-search
☆32Updated 9 months ago
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated 8 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆46Updated 3 months ago
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆75Updated 8 months ago
ApolloResearch / apd
Attribution-based Parameter Decomposition
☆26Updated last month
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆127Updated 2 years ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
srush / mamba-primer
☆37Updated last year
Bond1995 / Markov
Code for experiments on transformers using Markovian data.
☆17Updated 7 months ago
IBM / ColPret
Efficient Scaling laws and collaborative pretraining.
☆16Updated 5 months ago
EleutherAI / elk-generalization
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…
☆28Updated last year
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆77Updated 7 months ago
adamkarvonen / SAE_BoardGameEval
☆23Updated 5 months ago
ckkissane / crosscoder-model-diff-replication
Open source replication of Anthropic's Crosscoders for Model Diffing
☆57Updated 8 months ago
bilal-chughtai / rep-theory-mech-interp
☆26Updated 2 years ago
keyonvafa / world-model-evaluation
☆59Updated 8 months ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 7 months ago
JoshEngels / SAE-Dark-Matter
Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"
☆22Updated 5 months ago