apple / ml-np-raspLinks

☆19

Alternatives and similar repositories for ml-np-rasp

Users that are interested in ml-np-rasp are comparing it to the libraries listed below

Sorting:

samacqua / LARC
Language-annotated Abstraction and Reasoning Corpus
☆88Updated 2 years ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
TomFrederik / unseal
Mechanistic Interpretability for Transformer Models
☆51Updated 3 years ago
aks2203 / easy-to-hard
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆59Updated 3 years ago
Sea-Snell / grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆77Updated 3 years ago
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆127Updated 2 years ago
ssokota / mec
Code for minimum-entropy coupling.
☆32Updated last year
bilal-chughtai / rep-theory-mech-interp
☆26Updated 2 years ago
callummcdougall / sae_visualizer
☆28Updated last year
aks2203 / deep-thinking
A centralized place for deep thinking code and experiments
☆85Updated last year
JacobPfau / procgenAISC
☆19Updated 2 years ago
apple / ml-planner
☆53Updated last year
keyonvafa / world-model-evaluation
☆56Updated 8 months ago
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆35Updated last year
AsaCooperStickland / situational-awareness-evals
Measuring the situational awareness of language models
☆36Updated last year
likenneth / othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆182Updated 2 years ago
AllanYangZhou / universal_neural_functional
☆51Updated last year
victorvikram / ConceptARC
Materials for ConceptARC paper
☆96Updated 8 months ago
louiskirsch / vsml-neurips2021
Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905
☆33Updated 3 years ago
redwoodresearch / interp
Redwood Research's transformer interpretability tools
☆14Updated 3 years ago
thestephencasper / everything-you-need
we got you bro
☆35Updated 11 months ago
DAIOS-AI / mindscript
A programming language for formal/informal computation.
☆41Updated 2 weeks ago
yashbonde / rasp
Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.
☆56Updated 3 years ago
timaeus-research / devinterp
Tools for studying developmental interpretability in neural networks.
☆99Updated 3 weeks ago
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated last year
enajx / HyperNCA
☆39Updated 3 years ago
EleutherAI / elk-generalization
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…
☆28Updated last year
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆230Updated 5 months ago
mechanistic-interpretability-grokking / progress-measures-paper
☆68Updated 2 years ago