Furyton / awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆71Updated last month
Alternatives and similar repositories for awesome-language-model-analysis:
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆197Updated 3 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆69Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆103Updated 9 months ago
- ☆86Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆50Updated 9 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 4 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆75Updated 3 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆135Updated 3 months ago
- A Survey on Data Selection for Language Models☆203Updated 3 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆26Updated last week
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 9 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆96Updated 11 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆88Updated 2 weeks ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆149Updated last month
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆48Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆90Updated 3 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆153Updated 9 months ago
- A Survey on the Honesty of Large Language Models☆51Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆112Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆100Updated 4 months ago
- ☆85Updated 4 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated 11 months ago
- Accepted LLM Papers in NeurIPS 2024☆33Updated 3 months ago
- ☆48Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆156Updated 7 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆59Updated last year
- ☆45Updated 3 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆44Updated last month
- LoFiT: Localized Fine-tuning on LLM Representations☆30Updated 2 weeks ago