rattlesnakey / Awesome-Actionable-MI-SurveyLinks
The Github repo for our survey paper: "Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models"
β69Updated this week
Alternatives and similar repositories for Awesome-Actionable-MI-Survey
Users that are interested in Awesome-Actionable-MI-Survey are comparing it to the libraries listed below
Sorting:
- π curated list of awesome LMM hallucinations papers, methods & resources.β150Updated last year
- β69Updated 10 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"β93Updated last year
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibratiβ¦β46Updated last year
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.β89Updated 11 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository aggβ¦β180Updated 3 months ago
- β36Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.β85Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Models