☆79Feb 18, 2026Updated last month
Alternatives and similar repositories for scribe
Users that are interested in scribe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for training crosscoders☆16May 28, 2025Updated 9 months ago
- ☆20Apr 10, 2025Updated 11 months ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆69Updated this week
- A python sdk for LLM finetuning and inference on runpod infrastructure☆22Mar 16, 2026Updated last week
- Auditing agents for fine-tuning safety☆20Oct 21, 2025Updated 5 months ago
- ☆17Jul 9, 2025Updated 8 months ago
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆863Updated this week
- Code and materials for "Weird Generalization and Inductive Backdoors"☆37Jan 11, 2026Updated 2 months ago
- AlgZoo: uninterpreted models with fewer than 1,500 parameters☆45Jan 19, 2026Updated 2 months ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- Chaikin's smoothing algorithm extended to a multidimensional library☆14Dec 5, 2024Updated last year
- ☆39Jul 4, 2025Updated 8 months ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆18Apr 15, 2025Updated 11 months ago
- ☆87Updated this week
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆245Mar 16, 2026Updated last week
- Modified to support crosscoder training.☆25Feb 4, 2026Updated last month
- A TinyStories LM with SAEs and transcoders☆14Apr 3, 2025Updated 11 months ago
- Mechanistic Interpretability Visualizations using React☆331Dec 18, 2024Updated last year
- ☆403Aug 21, 2025Updated 7 months ago
- Applying SAEs for fine-grained control☆26Dec 15, 2024Updated last year
- [ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions☆14Mar 7, 2026Updated 2 weeks ago
- ☆48May 27, 2025Updated 9 months ago
- Dimensionality-reduction and classification for morphology and electrophysiology☆12Sep 23, 2024Updated last year
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆63Oct 27, 2024Updated last year
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- ☆24Aug 23, 2025Updated 7 months ago
- Prompts used in the Automated Auditing Blog Post☆143Jul 24, 2025Updated 7 months ago
- ppx_system is a syntax extension to known operating system at compile time☆12May 9, 2023Updated 2 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- ☆76Jul 24, 2025Updated 7 months ago
- ☆83Feb 25, 2025Updated last year
- Inference API for many LLMs and other useful tools for empirical research☆111Mar 11, 2026Updated last week
- This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …☆134Feb 8, 2026Updated last month
- ☆24Feb 23, 2026Updated last month
- ☆27Oct 6, 2024Updated last year
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- Mesoscale activity ephys ingest schema☆11Jul 18, 2023Updated 2 years ago
- Functional Vector Graphics☆17Jun 19, 2017Updated 8 years ago