Xmaster6y / lczerolens
š¬ Interpretability for Leela Chess Zero networks.
ā11Updated last month
Alternatives and similar repositories for lczerolens:
Users that are interested in lczerolens are comparing it to the libraries listed below
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"ā20Updated 8 months ago
- Redwood Research's transformer interpretability toolsā14Updated 2 years ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from eā¦ā26Updated 9 months ago
- Mechanistic Interpretability for Transformer Modelsā49Updated 2 years ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.ā26Updated 2 weeks ago
- https://footprints.baulab.infoā16Updated 4 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)ā18Updated last month
- Sparse Autoencoder Training Libraryā42Updated 4 months ago
- we got you broā35Updated 7 months ago
- A library for efficient patching and automatic circuit discovery.ā54Updated 2 weeks ago
- ā29Updated 10 months ago
- š§ Starter templates for doing interpretability researchā67Updated last year
- ā19Updated 2 years ago
- ā36Updated last year
- ā26Updated 10 months ago
- ā58Updated this week
- ā31Updated this week
- Tools for studying developmental interpretability in neural networks.ā85Updated last month
- ā57Updated 3 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffingā40Updated 4 months ago
- ā19Updated this week
- Utilities for the HuggingFace transformers libraryā64Updated 2 years ago
- Steering vectors for transformer language models in Pytorch / Huggingfaceā88Updated last week
- ā60Updated last month
- ā52Updated 5 months ago
- Measuring the situational awareness of language modelsā34Updated last year
- (Model-written) LLM evals libraryā18Updated 2 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"ā68Updated 3 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"ā65Updated 8 months ago
- A set of Python scripts that makes your experience on TPU betterā49Updated 8 months ago