Furyton / awesome-language-model-analysisLinks
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆93Updated 11 months ago
Alternatives and similar repositories for awesome-language-model-analysis
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
Sorting:
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆189Updated this week
- ☆181Updated 6 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆127Updated 3 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆31Updated 8 months ago
- ☆131Updated 8 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆120Updated 2 weeks ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆284Updated 7 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆83Updated 10 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆132Updated 7 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆121Updated last year
- A Sober Look at Language Model Reasoning☆87Updated last month
- A curated list of resources for activation engineering☆108Updated last month
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆138Updated 6 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆65Updated 3 months ago
- ☆55Updated 4 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆128Updated 4 months ago
- ☆282Updated 4 months ago
- awesome SAE papers☆57Updated 5 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆62Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆79Updated 2 years ago
- ☆215Updated 7 months ago
- ☆336Updated 3 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆61Updated 11 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆89Updated last month
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆117Updated 11 months ago
- A Survey on Data Selection for Language Models☆252Updated 6 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆84Updated 9 months ago
- ☆67Updated 7 months ago
- ☆212Updated 8 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆361Updated last month