Furyton / awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆75Updated 3 months ago
Alternatives and similar repositories for awesome-language-model-analysis:
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆70Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆52Updated 11 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆91Updated 2 weeks ago
- ☆89Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆106Updated 11 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆206Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆106Updated 5 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆87Updated 5 months ago
- ☆78Updated 2 months ago
- Code and Data Repo for [ICLR 2025] Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆30Updated 2 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 6 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆128Updated last week
- Function Vectors in Large Language Models (ICLR 2024)☆142Updated 5 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated last week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆51Updated 3 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆99Updated 4 months ago
- ☆78Updated this week
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆54Updated 11 months ago
- Collection of Reverse Engineering in Large Model☆32Updated 2 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆205Updated 2 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 2 weeks ago
- A Survey on the Honesty of Large Language Models☆54Updated 3 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆118Updated 6 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆95Updated 3 months ago