A collection of lightweight interpretability scripts to understand how LLMs think
☆89Mar 18, 2026Updated last week
Alternatives and similar repositories for llm-interp
Users that are interested in llm-interp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Feb 18, 2026Updated last month
- ☆43Jan 27, 2026Updated 2 months ago
- KV Cache & LoRA for minGPT☆60Mar 4, 2026Updated 3 weeks ago
- A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability☆30Jan 30, 2025Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- Tiny AI model embedded in NES ROMs to generate character names in-game.☆30Sep 28, 2025Updated 6 months ago
- Code for implementing central flows☆44Sep 5, 2025Updated 6 months ago
- Lean proof that a normed vector space with compact unit ball is finite dimensional☆11Dec 7, 2019Updated 6 years ago
- An introduction to DSPy☆34Aug 30, 2025Updated 6 months ago
- My website☆13Oct 18, 2025Updated 5 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- A boilerplate web app using axum, htmx, and tera (for templating). Demonstrates how these technologies can be used in tandem.☆15Sep 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆22Sep 16, 2025Updated 6 months ago
- This is the official codebase for paper: Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Acti…☆45Updated this week
- A simple REPL for Lean 4, returning information about errors and sorries.☆12Jun 19, 2023Updated 2 years ago
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆39Apr 24, 2025Updated 11 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated last year
- Rigorous computation of the endomorphism ring of a Jacobian☆11Jul 31, 2025Updated 7 months ago
- ☆28Sep 19, 2025Updated 6 months ago
- Pass loop info to LLVM☆21Sep 4, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆32Nov 4, 2025Updated 4 months ago
- DINO-based perceptual losses and FDD feature extraction☆26Jan 7, 2026Updated 2 months ago
- ☆60Feb 6, 2026Updated last month
- 🔬 Visualize attention layers from Stable Diffusion☆92Apr 1, 2025Updated 11 months ago
- A collection of some awesome public Julia programming language projects.☆21Feb 22, 2024Updated 2 years ago
- 6.790 | Machine Learning | Draft Site/Notes☆15Dec 5, 2025Updated 3 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆38Dec 18, 2025Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆129Jan 30, 2026Updated last month
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆91Jul 17, 2025Updated 8 months ago
- ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory☆58Nov 27, 2025Updated 4 months ago
- [ICCV 2025] Identity Preserving 3D Head Stylization with Multiview Score Distillation☆16Jun 25, 2025Updated 9 months ago
- LLAMA Turboquant implementation with CUDA support☆224Updated this week
- Lego for GRPO☆30May 27, 2025Updated 10 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 5 months ago