JShollaj / awesome-llm-interpretabilityView external linksLinks
A curated list of Large Language Model (LLM) Interpretability resources.
☆1,471Jun 22, 2025Updated 7 months ago
Alternatives and similar repositories for awesome-llm-interpretability
Users that are interested in awesome-llm-interpretability are comparing it to the libraries listed below
Sorting:
- This repository collects all relevant resources about interpretability in LLMs☆387Nov 1, 2024Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆295Jan 22, 2026Updated 3 weeks ago
- A library for mechanistic interpretability of GPT-style language models☆3,073Updated this week
- awesome papers in LLM interpretability☆609Aug 20, 2025Updated 5 months ago
- A unified evaluation framework for large language models☆2,775Jan 22, 2026Updated 3 weeks ago
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- Training Sparse Autoencoders on Language Models☆1,201Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,155Feb 8, 2026Updated last week
- 🔥Highlighting the top ML papers every week.☆12,231Jul 20, 2025Updated 6 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,407Apr 11, 2024Updated last year
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓☆3,538May 7, 2025Updated 9 months ago
- Sparse Autoencoder for Mechanistic Interpretability☆291Jul 20, 2024Updated last year
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 2 weeks ago
- A framework for few-shot evaluation of language models.☆11,393Updated this week
- Collection of Reverse Engineering in Large Model☆36Jan 8, 2025Updated last year
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,325Apr 8, 2024Updated last year
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- ☆230Nov 22, 2024Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,823Oct 28, 2025Updated 3 months ago
- ☆8,672Oct 9, 2024Updated last year
- ☆4,109Jun 4, 2024Updated last year
- The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et a…☆8,057Sep 12, 2025Updated 5 months ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆397Mar 2, 2025Updated 11 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,477Oct 31, 2023Updated 2 years ago
- Train transformer language models with reinforcement learning.☆17,360Updated this week
- All things prompt engineering☆5,733Jun 4, 2024Updated last year
- A resource repository for representation engineering in large language models☆148Nov 14, 2024Updated last year
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,822Jul 10, 2024Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,199Nov 3, 2025Updated 3 months ago
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆6,529Mar 19, 2025Updated 10 months ago
- Using sparse coding to find distributed representations used by neural networks.☆296Nov 10, 2023Updated 2 years ago
- LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. …☆1,247Dec 3, 2024Updated last year
- ☆2,551May 19, 2024Updated last year
- A library for advanced large language model reasoning☆2,330Jun 10, 2025Updated 8 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,852May 17, 2025Updated 8 months ago
- Awesome-LLM: a curated list of Large Language Model☆26,241Jul 31, 2025Updated 6 months ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆947Aug 14, 2024Updated last year
- Machine Learning Engineering Open Book☆16,675Updated this week
- Latest Advances on Multimodal Large Language Models☆17,337Feb 7, 2026Updated last week