A collection of lightweight interpretability scripts to understand how LLMs think
☆91Mar 18, 2026Updated 2 months ago
Alternatives and similar repositories for llm-interp
Users that are interested in llm-interp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- ☆10Nov 18, 2024Updated last year
- KV Cache & LoRA for minGPT☆63Mar 4, 2026Updated 3 months ago
- A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability☆32Jan 30, 2025Updated last year
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Dec 1, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆49May 27, 2025Updated last year
- Rethinking the Trust Region in LLM Reinforcement Learning☆55Mar 2, 2026Updated 3 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- A boilerplate web app using axum, htmx, and tera (for templating). Demonstrates how these technologies can be used in tandem.☆15Sep 7, 2023Updated 2 years ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆12Jun 19, 2023Updated 2 years ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated last year
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆40Apr 24, 2025Updated last year
- ☆24Jan 22, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- ☆15Apr 1, 2026Updated 2 months ago
- ☆30Sep 19, 2025Updated 8 months ago
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆33Nov 4, 2025Updated 7 months ago
- List of new Project Fundraising Opportunities for NumFOCUS Sponsored Projects☆12May 14, 2026Updated 3 weeks ago
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆22Nov 17, 2025Updated 6 months ago
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- Efficient non-uniform quantization with GPTQ for GGUF☆62Sep 17, 2025Updated 8 months ago
- ☆19May 19, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.☆123Apr 25, 2026Updated last month
- A collection of some awesome public Julia programming language projects.☆23Feb 22, 2024Updated 2 years ago
- 6.790 | Machine Learning | Draft Site/Notes☆15Dec 5, 2025Updated 6 months ago
- DINO-based perceptual losses and FDD feature extraction☆31Jan 7, 2026Updated 5 months ago
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆13Apr 10, 2025Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- ☆39Dec 18, 2025Updated 5 months ago
- ☆16May 8, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆92Jul 17, 2025Updated 10 months ago
- tensorrt部署教程☆11Aug 1, 2025Updated 10 months ago
- [ICCV 2025] Identity Preserving 3D Head Stylization with Multiview Score Distillation☆16Jun 25, 2025Updated 11 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆58Dec 28, 2025Updated 5 months ago
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆134Jan 30, 2026Updated 4 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Exploring Applications of GRPO☆253Aug 25, 2025Updated 9 months ago