ajobi-uhc/seer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ajobi-uhc/seer)

ajobi-uhc / seer

This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix some of the annoying things you get from only using Claude code out of the box

☆130

Alternatives and similar repositories for seer

Users that are interested in seer are comparing it to the libraries listed below

Sorting:

ariahw / rl-rewardhacking
View on GitHub
☆24Feb 18, 2026Updated 2 weeks ago
ApolloResearch / apd
View on GitHub
Attribution-based Parameter Decomposition
☆34Jun 11, 2025Updated 8 months ago
technion-cs-nlp / parametric-faithfulness
View on GitHub
☆17Aug 30, 2025Updated 6 months ago
koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
mega002 / llm-interp-tau
View on GitHub
Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University
☆305Feb 8, 2026Updated 3 weeks ago
adamkarvonen / SAEBench
View on GitHub
☆153Dec 30, 2025Updated 2 months ago
ckkissane / sae-transfer
View on GitHub
Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"
☆13Jul 18, 2024Updated last year
epfml / getting-started
View on GitHub
☆29Jan 12, 2026Updated last month
noanabeshima / tinymodel
View on GitHub
A TinyStories LM with SAEs and transcoders
☆14Apr 3, 2025Updated 11 months ago
tilde-research / activault
View on GitHub
Engine for collecting, uploading, and downloading model activations
☆26Apr 2, 2025Updated 11 months ago
saprmarks / dictionary_learning
View on GitHub
☆399Aug 21, 2025Updated 6 months ago
goodfire-ai / scribe
View on GitHub
☆75Feb 18, 2026Updated 2 weeks ago
UFO-101 / auto-circuit
View on GitHub
A library for efficient patching and automatic circuit discovery.
☆90Dec 31, 2025Updated 2 months ago
JasonGross / guarantees-based-mechanistic-interpretability
View on GitHub
☆18Feb 25, 2026Updated last week
EleutherAI / mdl
View on GitHub
Minimum Description Length probing for neural network representations
☆20Jan 28, 2025Updated last year
JoshEngels / MultiDimensionalFeatures
View on GitHub
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆84Nov 27, 2024Updated last year
decoderesearch / SAELens
View on GitHub
Training Sparse Autoencoders on Language Models
☆1,233Feb 27, 2026Updated last week
ArthurConmy / Automatic-Circuit-Discovery
View on GitHub
☆273Oct 1, 2024Updated last year
safety-research / safety-examples
View on GitHub
☆25Nov 11, 2025Updated 3 months ago
HumanCompatibleAI / leela-interp
View on GitHub
Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
☆27Jun 4, 2024Updated last year
aaronmueller / MIB
View on GitHub
Landing page for MIB: A Mechanistic Interpretability Benchmark
☆24Aug 15, 2025Updated 6 months ago
callummcdougall / sae_vis
View on GitHub
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆247Feb 27, 2026Updated last week
IBM / sae-steering
View on GitHub
Code to enable layer-level steering in LLMs using sparse auto encoders
☆31Sep 18, 2025Updated 5 months ago
interp-reasoning / thought-anchors
View on GitHub
⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.
☆117Oct 27, 2025Updated 4 months ago
EleutherAI / tokengrams
View on GitHub
Efficiently computing & storing token n-grams from large corpora
☆27Oct 6, 2024Updated last year
ericwtodd / function_vectors
View on GitHub
Function Vectors in Large Language Models (ICLR 2024)
☆192Apr 17, 2025Updated 10 months ago
TransformerLensOrg / TransformerLens
View on GitHub
A library for mechanistic interpretability of GPT-style language models
☆3,133Updated this week
safety-research / safety-tooling
View on GitHub
Inference API for many LLMs and other useful tools for empirical research
☆107Feb 27, 2026Updated last week
safety-research / finetuning-auditor
View on GitHub
Auditing agents for fine-tuning safety
☆20Oct 21, 2025Updated 4 months ago
alan-cooney / transformer-from-scratch
View on GitHub
Decoder only transformer, built from scratch with PyTorch
☆33Oct 22, 2023Updated 2 years ago
thestephencasper / benchmarking_interpretability
View on GitHub
☆35Sep 13, 2023Updated 2 years ago
LRudL / sad
View on GitHub
Situational Awareness Dataset
☆46Dec 14, 2024Updated last year
safety-research / open-source-alignment-faking
View on GitHub
Open Source Replication of Anthropic's Alignment Faking Paper
☆54Apr 4, 2025Updated 11 months ago
danielway / nexrad-volumetric-renderer
View on GitHub
Project exploring 3D volumetric rendering of NEXRAD radar data.
☆11Oct 23, 2023Updated 2 years ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆12May 30, 2025Updated 9 months ago
mind-games-challenge / mindgames-starter-kit
View on GitHub
The official starter-kit for NeurIPS 2025 mind games competition
☆21Jul 27, 2025Updated 7 months ago
amosproj / amos2022ws01-firmware-scraper
View on GitHub
Scape firmware metadata from 18 vendors and download corresponding firmware images. Save in MySQL database for InfoSec research purposes.
☆12Feb 17, 2023Updated 3 years ago
jbloomAus / DecisionTransformerInterpretability
View on GitHub
Interpreting how transformers simulate agents performing RL tasks
☆90Oct 23, 2023Updated 2 years ago
amack315 / unsupervised-steering-vectors
View on GitHub
☆36Apr 30, 2024Updated last year