oclivegriffin/crosscode

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oclivegriffin/crosscode)

oclivegriffin / crosscode

A library for training crosscoders

☆17

Alternatives and similar repositories for crosscode

Users that are interested in crosscode are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

curt-tigges / crosslayer-coding
View on GitHub
☆18Jul 9, 2025Updated last year
EleutherAI / attribute
View on GitHub
☆16Nov 14, 2025Updated 8 months ago
ckkissane / crosscoder-model-diff-replication
View on GitHub
Open source replication of Anthropic's Crosscoders for Model Diffing
☆68Oct 27, 2024Updated last year
ArthurConmy / MishformerLens
View on GitHub
MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…
☆10Oct 7, 2024Updated last year
goodfire-ai / param-decomp
View on GitHub
Parameter Decomposition
☆133Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
adamkarvonen / dictionary_learning_demo
View on GitHub
☆26Aug 23, 2025Updated 10 months ago
TruthfulAI-research / negation_neglect
View on GitHub
Code for Negation Neglect
☆16May 22, 2026Updated 2 months ago
curt-tigges / probity
View on GitHub
☆19Apr 10, 2025Updated last year
goodfire-ai / scribe
View on GitHub
☆85Feb 18, 2026Updated 5 months ago
koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
ApolloResearch / apd
View on GitHub
Attribution-based Parameter Decomposition
☆35Jun 11, 2025Updated last year
jacobdunefsky / llm-steering-opt
View on GitHub
Tools for optimizing steering vectors in LLMs.
☆22Apr 10, 2025Updated last year
jammastergirish / LLMProbe
View on GitHub
☆20Dec 10, 2025Updated 7 months ago
Butanium / tiny-activation-dashboard
View on GitHub
A tiny easily hackable implementation of a feature dashboard.
☆17Oct 21, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
saprmarks / dictionary_learning
View on GitHub
☆427Aug 21, 2025Updated 11 months ago
EleutherAI / delphi
View on GitHub
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …
☆266Updated this week
ndif-team / nnterp
View on GitHub
Unified access to Large Language Model modules using NNsight
☆116Jul 2, 2026Updated 2 weeks ago
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
joysatisficer / chapter2
View on GitHub
☆19Mar 17, 2026Updated 4 months ago
yash-srivastava19 / arrakis
View on GitHub
Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.
☆31Jul 8, 2026Updated last week
ApolloResearch / e2e_sae
View on GitHub
Sparse Autoencoder Training Library
☆58May 1, 2025Updated last year
Psi-Prod / ppx_system
View on GitHub
ppx_system is a syntax extension to known operating system at compile time
☆12May 9, 2023Updated 3 years ago
neelnanda-io / Crosscoders
View on GitHub
☆60Nov 19, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
decoderesearch / automated-interpretability
View on GitHub
☆24Feb 13, 2026Updated 5 months ago
HugoFry / mats_sae_training_for_ViTs
View on GitHub
☆25Apr 23, 2024Updated 2 years ago
clarifying-EM / model-organisms-for-EM
View on GitHub
Code repo for the model organisms and convergent directions of EM papers.
☆72Sep 22, 2025Updated 9 months ago
thejaminator / latteries
View on GitHub
James' cookbook of evaluations and finetuning experiments
☆32Feb 19, 2026Updated 5 months ago
ajobi-uhc / seer
View on GitHub
This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …
☆146Feb 8, 2026Updated 5 months ago
EleutherAI / clt-training
View on GitHub
Sparsify transformers with cross-layer transcoders
☆26Nov 14, 2025Updated 8 months ago
uzaymacar / blackjack-with-gui
View on GitHub
A Blackjack game with GUI written in Java.
☆11Nov 21, 2018Updated 7 years ago
UKGovernmentBEIS / vllm-lens
View on GitHub
Extract residual-stream activations and apply steering vectors (including activation oracles) to any vLLM model during inference.
☆117Updated this week
c-cube / ocaml-avro
View on GitHub
[DEPRECATED (use avro-simple)] Runtime library and schema compiler for the Avro serialization format.
☆21Jul 7, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
seanjhardy / HyperLife
View on GitHub
A realtime multicellular organism evolution simulator with Verlet integration
☆12May 30, 2021Updated 5 years ago
robostac / coders-strike-back-referee
View on GitHub
Brutaltester compatible referee for coders strike back
☆13Jun 1, 2026Updated last month
lafeychine / scala-native-sfml
View on GitHub
Scala Native 3 bindings for SFML library
☆15Jul 9, 2023Updated 3 years ago
science-of-finetuning / sparsity-artifacts-crosscoders
View on GitHub
Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.
☆17Jul 6, 2026Updated 2 weeks ago
jbkjr / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14May 17, 2024Updated 2 years ago
chanind / linear-relational
View on GitHub
Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch
☆11Aug 7, 2024Updated last year
uzaymacar / self-supervision
View on GitHub
Implementations of several self-supervised pretext tasks for language and vision modalities in PyTorch.
☆13Jan 19, 2021Updated 5 years ago