RiddleHe/llm-interp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RiddleHe/llm-interp)

RiddleHe / llm-interp

A collection of lightweight interpretability scripts to understand how LLMs think

☆90

Alternatives and similar repositories for llm-interp

Users that are interested in llm-interp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TruthfulAI-research / negation_neglect
View on GitHub
Code for Negation Neglect
☆16May 22, 2026Updated last month
main-horse / hnet-old
View on GitHub
H-Net Dynamic Hierarchical Architecture
☆81Sep 11, 2025Updated 10 months ago
TransluceAI / circuits
View on GitHub
ADAG: Transluce's MLP neuron-level circuit tracing library
☆33Apr 10, 2026Updated 3 months ago
idoatad / TensorLens
View on GitHub
Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]
☆47Apr 14, 2026Updated 3 months ago
namiyousef / argument-mining
View on GitHub
Repository for NLP project. Name to be changed when we decide on a project
☆16Apr 19, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dnakov / hrm-mlx
View on GitHub
MLX implementation of Hierarchical Reasoning Model (HRM) - Adaptive computation for complex reasoning tasks
☆29Aug 27, 2025Updated 10 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
Da1sypetals / SnapViewer
View on GitHub
PyTorch memory allocation visualizer
☆76Mar 6, 2026Updated 4 months ago
YibooZhao / cogvideox_vis_attention
View on GitHub
☆10Nov 18, 2024Updated last year
Laurian / context-compression-experiments-2508
View on GitHub
prompt engineering experiments with DSPy GEPA and TextGrad
☆70Sep 2, 2025Updated 10 months ago
safety-research / introspection-adapters
View on GitHub
Training LLMs to Report Their Learned Behaviors
☆27Apr 28, 2026Updated 2 months ago
rankdim / torus
View on GitHub
Game of Life on a Toroidal Surface
☆16Aug 5, 2025Updated 11 months ago
ESHyperscale / nano-egg
View on GitHub
Evolution Pretraining Fully in Int Formats
☆177Feb 25, 2026Updated 4 months ago
thepowerfuldeez / sample_efficient_gpt
View on GitHub
Training framework with a goal to explore the frontier of sample efficiency of small language models
☆101Jan 25, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
akshat57 / how-do-llms-use-their-depth
View on GitHub
☆19Nov 24, 2025Updated 7 months ago
OpenRewardAI / openreward-cookbook
View on GitHub
Training and evaluating with OpenReward
☆33Apr 28, 2026Updated 2 months ago
cococry / Lantern
View on GitHub
Stack based programming language in C
☆21Apr 12, 2023Updated 3 years ago
IST-DASLab / gptq-gguf-toolkit
View on GitHub
Efficient non-uniform quantization with GPTQ for GGUF
☆64Sep 17, 2025Updated 10 months ago
reka-ai / rekaquant
View on GitHub
☆63Jul 10, 2025Updated last year
eth-easl / mixtera
View on GitHub
A lightweight, user-friendly data-plane for LLM training.
☆40Sep 10, 2025Updated 10 months ago
santiagomed / x-customized-feed
View on GitHub
X Developer Challenge
☆12Apr 25, 2024Updated 2 years ago
katiekang1998 / reasoning_generalization
View on GitHub
☆33Jan 7, 2025Updated last year
TransluceAI / introspective-interp
View on GitHub
Repository for "Training Language Models To Explain Their Own Computations"
☆23Jul 7, 2026Updated 2 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yaof20 / DenseMixer
View on GitHub
Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient
☆67Aug 3, 2025Updated 11 months ago
cavaunpeu / mcts-llm-codegen
View on GitHub
A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)
☆17Dec 1, 2023Updated 2 years ago
tilde-research / activault
View on GitHub
Engine for collecting, uploading, and downloading model activations
☆30Apr 2, 2025Updated last year
zlab-princeton / llm-pruning-collection
View on GitHub
A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.
☆69Apr 20, 2026Updated 3 months ago
joey00072 / Attention-as-graph
View on GitHub
alternative way to calculating self attention
☆18May 25, 2024Updated 2 years ago
goodfire-ai / r1-interpretability
View on GitHub
Open source interpretability artefacts for R1.
☆183Apr 21, 2025Updated last year
nickjiang2378 / interp-embed
View on GitHub
A toolkit for embedding text datasets with sparse autoencoders
☆30Mar 24, 2026Updated 3 months ago
WujiangXu / MemGym
View on GitHub
The code for paper "MemGym: a Long-Horizon Memory Environment for LLM Agents".
☆18Jun 2, 2026Updated last month
SinatrasC / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆17Oct 9, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
dropbox / low-rank-llama2
View on GitHub
Low-Rank Llama Custom Training
☆23Mar 27, 2024Updated 2 years ago
doomslide / attention-graph
View on GitHub
A graph visualization of attention
☆56May 20, 2025Updated last year
lacoco-lab / decompiling_transformers
View on GitHub
Repo for Paper: Discovering Interpretable Algorithms by Decompiling Transformers to RASP
☆15May 25, 2026Updated last month
technion-cs-nlp / vlm-circuits-analysis
View on GitHub
Code for the experiments and websites of the paper "Same Task, Different Circuits"
☆36Jun 9, 2026Updated last month
dataflowr / gpu_llm_flash-attention
View on GitHub
Course on Flash-attention in Triton
☆100Feb 9, 2026Updated 5 months ago
FlashSampling / FlashSampling
View on GitHub
FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)
☆76Jun 15, 2026Updated last month
CMU-AIRe / POPE
View on GitHub
☆27Jan 31, 2026Updated 5 months ago