delphi-suite / delphiLinks
small language models training made easy
☆13Updated 10 months ago
Alternatives and similar repositories for delphi
Users that are interested in delphi are comparing it to the libraries listed below
Sorting:
- Tools for studying developmental interpretability in neural networks.☆105Updated 3 months ago
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
- ☆189Updated 3 months ago
- ☆19Updated 6 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆216Updated last week
- Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated 11 months ago
- ☆54Updated 10 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆221Updated 10 months ago
- Modified to support crosscoder training.☆23Updated last week
- 🧠 Starter templates for doing interpretability research☆75Updated 2 years ago
- ☆348Updated last month
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆24Updated 10 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Updated 8 months ago
- Universal Neurons in GPT2 Language Models☆30Updated last year
- Sparse Autoencoder Training Library☆55Updated 5 months ago
- TransformerLens + HuggingFace☆11Updated last year
- Mechanistic Interpretability Visualizations using React☆291Updated 10 months ago
- ☆29Updated last year
- Sparse Autoencoder for Mechanistic Interpretability☆272Updated last year
- ☆131Updated this week
- ☆244Updated last year
- ☆27Updated 2 years ago
- ☆81Updated 7 months ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆31Updated last year
- Unified access to Large Language Model modules using NNsight☆49Updated last week
- ☆128Updated last year
- ☆23Updated 10 months ago
- Tools for optimizing steering vectors in LLMs.☆13Updated 6 months ago
- ☆36Updated last year
- ☆106Updated 8 months ago