The Github repo for our survey paper: "Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models"
☆100Jan 30, 2026Updated last month
Alternatives and similar repositories for Awesome-Actionable-MI-Survey
Users that are interested in Awesome-Actionable-MI-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving☆59Feb 28, 2026Updated 3 weeks ago
- A resource repository for representation engineering in large language models☆149Nov 14, 2024Updated last year
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆14Aug 11, 2020Updated 5 years ago
- ☆33Feb 15, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 9 months ago
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆77Jan 16, 2026Updated 2 months ago
- Simple program to chat in Java☆21Apr 27, 2020Updated 5 years ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆62Mar 30, 2024Updated last year
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆17May 17, 2025Updated 10 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆18Sep 15, 2025Updated 6 months ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated 11 months ago
- Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.☆25Mar 27, 2025Updated 11 months ago
- ☆29Jul 24, 2025Updated 8 months ago
- Repo for "Uncertain Multimodal Intention and Emotion Understanding in the Wild"☆16Oct 20, 2025Updated 5 months ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 4 months ago
- [AAAI-25] Official repository of "Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object De…☆20Dec 27, 2024Updated last year
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆15Jul 2, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆18Jul 3, 2024Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated last year
- The source code of Disconnected Emerging Knowledge Graph Oriented Inductive Link Prediction☆10May 6, 2022Updated 3 years ago
- awesome papers in LLM interpretability☆611Aug 20, 2025Updated 7 months ago
- Python class for training models on imbalanced data using bagging over balanced samples.☆10Oct 20, 2016Updated 9 years ago
- ☆18Jun 20, 2025Updated 9 months ago
- ☆16Sep 1, 2025Updated 6 months ago
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆34Mar 18, 2026Updated last week
- This is the open-source code for TokenCarve.☆26Jan 23, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Paper list of Video LLM hallucination. Welcome to Star and Contribute!☆23Mar 6, 2026Updated 2 weeks ago
- [ICCV 2025] Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction☆26Oct 27, 2025Updated 4 months ago
- ☆20Jul 16, 2024Updated last year
- ☆92Dec 23, 2024Updated last year
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆194Mar 4, 2026Updated 3 weeks ago
- 🖋 Resource and Tool for Writing System Identification (Unicode 17.0) -- LREC 2024☆21Feb 17, 2026Updated last month
- ☆21Jun 27, 2024Updated last year