Furyton / awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆80Updated 4 months ago
Alternatives and similar repositories for awesome-language-model-analysis:
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆71Updated 2 years ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆108Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆75Updated 2 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆72Updated this week
- The Paper List on Data Contamination for Large Language Models Evaluation.☆92Updated 3 weeks ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆223Updated last month
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆162Updated last year
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆41Updated 4 months ago
- A curated list of resources for activation engineering☆65Updated 2 weeks ago
- ☆93Updated last year
- Function Vectors in Large Language Models (ICLR 2024)☆161Updated last week
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated last month
- A brief and partial summary of RLHF algorithms.☆127Updated last month
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆35Updated 3 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆99Updated 6 months ago
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆17Updated 3 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆115Updated last month
- ☆90Updated 3 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 4 months ago
- ☆93Updated last month
- Collection of Reverse Engineering in Large Model☆32Updated 3 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆60Updated last year
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆133Updated last month
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆122Updated 9 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆42Updated 3 weeks ago
- ☆54Updated last week
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆175Updated last month
- ☆60Updated this week