☆83Updated this week
Alternatives and similar repositories for docent
Users that are interested in docent are comparing it to the libraries listed below
Sorting:
- ☆36Jul 4, 2025Updated 7 months ago
- ☆25Sep 3, 2025Updated 5 months ago
- The codebase for Inducing Causal Structure for Interpretable Neural Networks☆11Dec 3, 2021Updated 4 years ago
- Unified access to Large Language Model modules using NNsight☆93Updated this week
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆62Feb 22, 2026Updated last week
- ☆17Jul 9, 2025Updated 7 months ago
- ☆19Sep 16, 2025Updated 5 months ago
- https://footprints.baulab.info☆17Oct 4, 2024Updated last year
- Engine for collecting, uploading, and downloading model activations☆26Apr 2, 2025Updated 11 months ago
- ☆74Feb 18, 2026Updated last week
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- ☆20Apr 10, 2025Updated 10 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆230Dec 12, 2025Updated 2 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆243Feb 23, 2026Updated last week
- Mapping out the "memory" of neural nets with data attribution☆45Updated this week
- Modified to support crosscoder training.☆25Feb 4, 2026Updated 3 weeks ago
- ☆27Oct 22, 2024Updated last year
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆825Updated this week
- ☆66Feb 16, 2023Updated 3 years ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Apr 22, 2025Updated 10 months ago
- Auditing agents for fine-tuning safety☆20Oct 21, 2025Updated 4 months ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆42Updated this week
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆36Jun 8, 2023Updated 2 years ago
- ☆52Oct 23, 2023Updated 2 years ago
- Open source interpretability artefacts for R1.☆171Apr 21, 2025Updated 10 months ago
- A Blackjack game with GUI written in Java.☆11Nov 21, 2018Updated 7 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- ☆14Apr 29, 2025Updated 10 months ago
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated last week
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- ☆12Jan 11, 2026Updated last month
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Jan 11, 2024Updated 2 years ago