TransluceAI/docent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TransluceAI/docent)

TransluceAI / docent

☆114

Alternatives and similar repositories for docent

Users that are interested in docent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nickjiang2378 / interp-embed
View on GitHub
A toolkit for embedding text datasets with sparse autoencoders
☆30Mar 24, 2026Updated 3 months ago
TransluceAI / jailbreaking-frontier-models
View on GitHub
☆28Sep 3, 2025Updated 10 months ago
EleutherAI / delphi
View on GitHub
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …
☆266Updated this week
METR / vivaria
View on GitHub
Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
☆140May 18, 2026Updated 2 months ago
TransluceAI / observatory
View on GitHub
A toolkit for describing model features and intervening on those features to steer behavior.
☆247Mar 16, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ndif-team / workbench
View on GitHub
☆16Updated this week
ndif-team / nnterp
View on GitHub
Unified access to Large Language Model modules using NNsight
☆116Jul 2, 2026Updated 2 weeks ago
TransluceAI / circuits
View on GitHub
ADAG: Transluce's MLP neuron-level circuit tracing library
☆33Apr 10, 2026Updated 3 months ago
explanare / ravel
View on GitHub
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
☆58Oct 30, 2025Updated 8 months ago
rmovva / HypotheSAEs
View on GitHub
HypotheSAEs: hypothesizing interpretable relationships in text datasets using sparse autoencoders. https://arxiv.org/abs/2502.04382
☆91Jul 2, 2026Updated 2 weeks ago
goodfire-ai / r1-interpretability
View on GitHub
Open source interpretability artefacts for R1.
☆183Apr 21, 2025Updated last year
curt-tigges / probity
View on GitHub
☆19Apr 10, 2025Updated last year
Phylliida / OpenClio
View on GitHub
Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use
☆82Aug 19, 2025Updated 11 months ago
brucewlee / mini-control-arena
View on GitHub
AI Control evaluation library. Built natively on Inspect AI.
☆17Feb 25, 2026Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
redwoodresearch / rust_circuit_public
View on GitHub
☆67Feb 16, 2023Updated 3 years ago
TransluceAI / introspective-interp
View on GitHub
Repository for "Training Language Models To Explain Their Own Computations"
☆23Jul 7, 2026Updated 2 weeks ago
frankaging / Interchange-Intervention-Training
View on GitHub
The codebase for Inducing Causal Structure for Interpretable Neural Networks
☆11Dec 3, 2021Updated 4 years ago
UKGovernmentBEIS / inspect_evals
View on GitHub
Collection of evals for Inspect AI
☆592Updated this week
UKGovernmentBEIS / vllm-lens
View on GitHub
Extract residual-stream activations and apply steering vectors (including activation oracles) to any vLLM model during inference.
☆117Updated this week
sfeucht / footprints
View on GitHub
https://footprints.baulab.info
☆17Oct 4, 2024Updated last year
TransformerLensOrg / CircuitsVis
View on GitHub
Mechanistic Interpretability Visualizations using React
☆358Apr 30, 2026Updated 2 months ago
jiahai-feng / binding-iclr
View on GitHub
☆19Mar 5, 2024Updated 2 years ago
ndif-team / nnsight
View on GitHub
The nnsight package enables interpreting and manipulating the internals of deep learned models.
☆995Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
science-of-finetuning / diffing-toolkit
View on GitHub
A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.
☆78Updated this week
safety-research / auditing-agents
View on GitHub
☆27Jul 1, 2026Updated 3 weeks ago
openai / chz
View on GitHub
☆235Nov 24, 2025Updated 7 months ago
scaleapi / mrt
View on GitHub
https://scale.com/research/mrt
☆20Mar 16, 2026Updated 4 months ago
ndif-team / ndif
View on GitHub
The NDIF server, which performs deep inference and serves nnsight requests remotely
☆50Updated this week
anthropic-experimental / automated-auditing
View on GitHub
Prompts used in the Automated Auditing Blog Post
☆166Jul 24, 2025Updated 11 months ago
OscarXZQ / delta_activations
View on GitHub
Official code release for Delta Activations: A Representation for Finetuned Large Language Models
☆20Sep 5, 2025Updated 10 months ago
xAlg-ai / HashAttention-1.0
View on GitHub
☆18Sep 23, 2025Updated 9 months ago
meridianlabs-ai / inspect_petri
View on GitHub
An alignment auditing agent capable of quickly exploring alignment hypothesis
☆1,263Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
meridianlabs-ai / inspect_scout
View on GitHub
In-depth analysis of AI agent transcripts.
☆57Updated this week
maiush / OpenCharacterTraining
View on GitHub
Open Character Training
☆92Apr 4, 2026Updated 3 months ago
goodfire-ai / scribe
View on GitHub
☆85Feb 18, 2026Updated 5 months ago
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
RobertCsordas / llm_effective_depth
View on GitHub
Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"
☆29Jun 25, 2025Updated last year
lisadunlap / VibeCheck
View on GitHub
Automated Qualitative Analysis of LLMs (ICLR 2025)
☆53Jul 6, 2025Updated last year
METR / hcast-public
View on GitHub
☆22Jul 6, 2026Updated 2 weeks ago