cooperleong00 / Awesome-LLM-Interpretability
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
☆166Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-LLM-Interpretability
- A Survey on Data Selection for Language Models☆178Updated 3 weeks ago
- LLM hallucination paper list☆289Updated 7 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆96Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆81Updated last month
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆428Updated 6 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆133Updated 11 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆152Updated 9 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆101Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆94Updated 6 months ago
- A resource repository for representation engineering in large language models☆50Updated last month
- ☆31Updated 4 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆103Updated 3 weeks ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆73Updated this week
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆206Updated last year
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆368Updated 3 weeks ago
- LLM Unlearning☆123Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆66Updated 3 weeks ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆156Updated 3 months ago
- For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.☆45Updated this week
- Function Vectors in Large Language Models (ICLR 2024)☆116Updated 3 weeks ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆75Updated 2 months ago
- ☆78Updated last year
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆51Updated this week
- ☆136Updated 4 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆63Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆202Updated this week
- Continual Learning of Large Language Models: A Comprehensive Survey☆239Updated last month
- ☆60Updated last month
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆45Updated 7 months ago
- ☆106Updated 9 months ago