A collection of lightweight interpretability scripts to understand how LLMs think
☆89Mar 18, 2026Updated last month
Alternatives and similar repositories for llm-interp
Users that are interested in llm-interp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 18, 2024Updated last year
- KV Cache & LoRA for minGPT☆62Mar 4, 2026Updated last month
- A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability☆30Jan 30, 2025Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Code for the book Deep Learning with PyTorch by Howard Huang, Eli Stevens, Luca Antiga, and Thomas Viehmann.☆50Mar 28, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆30Feb 18, 2026Updated 2 months ago
- ☆49May 27, 2025Updated 10 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆52Mar 2, 2026Updated last month
- An introduction to DSPy☆34Aug 30, 2025Updated 7 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- A boilerplate web app using axum, htmx, and tera (for templating). Demonstrates how these technologies can be used in tandem.☆15Sep 7, 2023Updated 2 years ago
- This is the official codebase for paper: Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Acti…☆45Apr 9, 2026Updated last week
- ☆28Sep 19, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆33Nov 4, 2025Updated 5 months ago
- DINO-based perceptual losses and FDD feature extraction☆26Jan 7, 2026Updated 3 months ago
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆20Nov 17, 2025Updated 5 months ago
- The old version of https://internet.dev☆22Jan 22, 2025Updated last year
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- ☆19May 19, 2025Updated 11 months ago
- ☆62Feb 6, 2026Updated 2 months ago
- 🔬 Visualize attention layers from Stable Diffusion☆92Apr 1, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A collection of some awesome public Julia programming language projects.☆22Feb 22, 2024Updated 2 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 6 months ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆13Apr 10, 2025Updated last year
- ☆11Feb 22, 2025Updated last year
- Pure C implementation of Voxtral-4B-TTS-2603☆86Mar 27, 2026Updated 3 weeks ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- ☆38Dec 18, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆132Jan 30, 2026Updated 2 months ago
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆92Jul 17, 2025Updated 9 months ago
- [ICCV 2025] Identity Preserving 3D Head Stylization with Multiview Score Distillation☆16Jun 25, 2025Updated 9 months ago
- Lego for GRPO☆30May 27, 2025Updated 10 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆54Dec 28, 2025Updated 3 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago