anthropics/PySvelte

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anthropics/PySvelte)

anthropics / PySvelte

A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations

☆205

Alternatives and similar repositories for PySvelte

Users that are interested in PySvelte are comparing it to the libraries listed below

Sorting:

TomFrederik / unseal
View on GitHub
Mechanistic Interpretability for Transformer Models
☆53Jun 1, 2022Updated 3 years ago
Mech-Interp / PySvelte
View on GitHub
A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations
☆15Apr 15, 2024Updated last year
TransformerLensOrg / CircuitsVis
View on GitHub
Mechanistic Interpretability Visualizations using React
☆326Dec 18, 2024Updated last year
redwoodresearch / interp
View on GitHub
Redwood Research's transformer interpretability tools
☆15Apr 15, 2022Updated 3 years ago
ArthurConmy / Automatic-Circuit-Discovery
View on GitHub
☆271Oct 1, 2024Updated last year
anthropics / toy-models-of-superposition
View on GitHub
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆137Sep 14, 2022Updated 3 years ago
TransformerLensOrg / TransformerLens
View on GitHub
A library for mechanistic interpretability of GPT-style language models
☆3,112Updated this week
EleutherAI / mdl
View on GitHub
Minimum Description Length probing for neural network representations
☆20Jan 28, 2025Updated last year
anthropics / evals
View on GitHub
☆330Jul 2, 2024Updated last year
GDPlumb / ExpO
View on GitHub
Explanation Optimization
☆13Oct 16, 2020Updated 5 years ago
ai-safety-foundation / sparse_autoencoder
View on GitHub
Sparse Autoencoder for Mechanistic Interpretability
☆292Jul 20, 2024Updated last year
callummcdougall / sae_vis
View on GitHub
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆247Updated this week
redwoodresearch / mlab
View on GitHub
Machine Learning for Alignment Bootcamp
☆82Apr 27, 2022Updated 3 years ago
Prisma-Multimodal / ViT-Prisma
View on GitHub
ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).
☆340Jul 23, 2025Updated 7 months ago
AndreasMadsen / nlp-roar-interpretability
View on GitHub
Measuring if attention is explanation with ROAR
☆22Mar 3, 2023Updated 2 years ago
openai / automated-interpretability
View on GitHub
☆1,072Mar 6, 2024Updated last year
likenneth / othello_world
View on GitHub
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆202Jul 12, 2023Updated 2 years ago
neelnanda-io / 1L-Sparse-Autoencoder
View on GitHub
☆134Oct 28, 2023Updated 2 years ago
neelnanda-io / Neuroscope
View on GitHub
Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons
☆13Feb 13, 2023Updated 3 years ago
quantified-uncertainty / ai-safety-papers
View on GitHub
☆22Sep 9, 2021Updated 4 years ago
HoagyC / sparse_coding
View on GitHub
Using sparse coding to find distributed representations used by neural networks.
☆297Nov 10, 2023Updated 2 years ago
LoryPack / LLM-LieDetector
View on GitHub
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆71Jun 19, 2024Updated last year
anthropics / sycophancy-to-subterfuge-paper
View on GitHub
☆25Sep 5, 2024Updated last year
EleutherAI / features-across-time
View on GitHub
Understanding how features learned by neural networks evolve throughout training
☆41Oct 24, 2024Updated last year
yizhe-ang / interactive-transformer
View on GitHub
A visual interface for understanding and interpreting Transformers
☆78Oct 21, 2023Updated 2 years ago
alan-cooney / transformer-lens-starter-template
View on GitHub
A quick way to get started with Transformer Lens
☆14Dec 13, 2023Updated 2 years ago
decoderesearch / SAELens
View on GitHub
Training Sparse Autoencoders on Language Models
☆1,219Updated this week
callummcdougall / ARENA_2.0
View on GitHub
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
☆238Aug 11, 2025Updated 6 months ago
saprmarks / dictionary_learning
View on GitHub
☆396Aug 21, 2025Updated 6 months ago
guy-dar / embedding-space
View on GitHub
☆57Jun 15, 2023Updated 2 years ago
frankaging / Causal-Distill
View on GitHub
The Codebase for Causal Distillation for Language Models (NAACL '22)
☆26May 1, 2022Updated 3 years ago
YalaLab / atlas
View on GitHub
☆18Mar 19, 2025Updated 11 months ago
jjbrophy47 / tree_influence
View on GitHub
Influence Estimation for Gradient-Boosted Decision Trees
☆29May 27, 2024Updated last year
EleutherAGI / summarisation
View on GitHub
The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…
☆12Jul 14, 2021Updated 4 years ago
ericwtodd / function_vectors
View on GitHub
Function Vectors in Large Language Models (ICLR 2024)
☆192Apr 17, 2025Updated 10 months ago
anthropics / attribution-graphs-frontend
View on GitHub
https://transformer-circuits.pub/2025/attribution-graphs/methods.html
☆91Mar 27, 2025Updated 11 months ago
wesg52 / sparse-probing-paper
View on GitHub
Sparse probing paper full code.
☆67Dec 17, 2023Updated 2 years ago
saprmarks / feature-circuits
View on GitHub
☆209Oct 14, 2025Updated 4 months ago
collin-burns / discovering_latent_knowledge
View on GitHub
☆284Mar 2, 2024Updated last year