wesg52 / universal-neurons
Universal Neurons in GPT2 Language Models
☆25Updated 3 months ago
Related projects: ⓘ
- ☆48Updated 3 months ago
- ☆54Updated last week
- ☆33Updated 3 months ago
- ☆75Updated this week
- ☆68Updated 7 months ago
- ☆47Updated 3 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆57Updated last week
- ☆44Updated 11 months ago
- ☆23Updated last year
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆14Updated 8 months ago
- ☆68Updated 3 weeks ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆23Updated 3 months ago
- A library for efficient patching and automatic circuit discovery.☆18Updated 3 weeks ago
- Sparse and discrete interpretability tool for neural networks☆51Updated 7 months ago
- ☆42Updated 3 months ago
- ☆46Updated 7 months ago
- ☆64Updated last month
- Sparse Autoencoder Training Library☆18Updated last month
- ☆12Updated 8 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated 10 months ago
- ☆66Updated last month
- ☆50Updated last month
- Algebraic value editing in pretrained language models☆54Updated 10 months ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆34Updated last year
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆33Updated last month
- Evaluation of neuro-symbolic engines☆29Updated last month
- A MAD laboratory to improve AI architecture designs 🧪☆84Updated 4 months ago
- Sparse probing paper full code.☆47Updated 9 months ago
- ☆42Updated 7 months ago
- ☆29Updated 10 months ago