Furyton / awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆51Updated last week
Related projects ⓘ
Alternatives and complementary repositories for awesome-language-model-analysis
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆45Updated 7 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆63Updated last year
- ☆44Updated 10 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆58Updated 11 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆101Updated last month
- ☆78Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆69Updated 8 months ago
- A resource repository for representation engineering in large language models☆50Updated last month
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆166Updated 3 weeks ago
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆27Updated 3 weeks ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆50Updated 6 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆62Updated 2 years ago
- ☆33Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆66Updated 3 weeks ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆96Updated 7 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆83Updated 5 months ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆68Updated 7 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆73Updated this week
- ☆26Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆87Updated 5 months ago
- Official Code for Paper: Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications☆58Updated last month
- ☆11Updated 8 months ago
- ☆68Updated 3 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆68Updated 7 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆116Updated 3 weeks ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆152Updated 9 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆51Updated this week
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆46Updated last month
- ☆37Updated last year
- A curated list of Model Merging methods.☆82Updated last month