The Github repo for our survey paper: "Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models"
☆124Apr 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for Awesome-Actionable-MI-Survey
Users that are interested in Awesome-Actionable-MI-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A resource repository for representation engineering in large language models☆151Nov 14, 2024Updated last year
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving☆66Feb 28, 2026Updated 2 months ago
- Collection of Reverse Engineering in Large Model☆36Jan 8, 2025Updated last year
- 🚀 First survey on Attention Sink in Transformers — 180+ papers on utilization, interpretation, and mitigation.☆73Apr 16, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆35Feb 15, 2026Updated 2 months ago
- ☆41Jan 30, 2026Updated 3 months ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 10 months ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 4 months ago
- Simple program to chat in Java☆21Apr 27, 2020Updated 6 years ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆62Mar 30, 2024Updated 2 years ago
- ☆21Oct 2, 2024Updated last year
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆18May 17, 2025Updated 11 months ago
- KDD2024: This is the code for the paper "Propagation Structure-aware Graph Transformer for Robust and Interpretable Fake News Detection"☆12Aug 31, 2024Updated last year
- A curated list of awesome Unlearnable Example papers resources.☆13Dec 14, 2025Updated 4 months ago
- [USENIX Security '25] My ZIP isn’t your ZIP: Identifying and Exploiting Semantic Gaps Between ZIP Parsers☆38Mar 20, 2026Updated last month
- awesome papers in LLM interpretability☆618Aug 20, 2025Updated 8 months ago
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals; ACL 2024☆12May 24, 2024Updated last year
- The public reproducible analysis code used for the gaze project☆10Feb 21, 2026Updated 2 months ago
- ☆30Jul 24, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits☆41Jan 8, 2026Updated 3 months ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- A reconstruction framework for materializing subjective experiences from brain signals☆14Jan 18, 2025Updated last year
- (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''☆39Oct 24, 2024Updated last year
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆32Apr 20, 2025Updated last year
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- ☆12Apr 19, 2022Updated 4 years ago
- Official Code Repository for paper "HYDRA: Model Factorization Framework for Black-Box LLM Personalization"☆16Oct 7, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]