Furyton / awesome-language-model-analysisLinks
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆98Updated last year
Alternatives and similar repositories for awesome-language-model-analysis
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
Sorting:
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆261Updated last week
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆145Updated 5 months ago
- ☆204Updated 3 weeks ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Updated 10 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆150Updated 2 months ago
- ☆140Updated 10 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆290Updated 3 weeks ago
- awesome SAE papers☆70Updated 7 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆93Updated last year
- 📜 Paper list on decoding methods for LLMs and LVLMs☆66Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆127Updated last year
- ☆72Updated 9 months ago
- ☆299Updated 6 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 9 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆190Updated 8 months ago
- ☆68Updated 10 months ago
- ☆346Updated 5 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆73Updated 5 months ago
- A curated list of resources for activation engineering☆121Updated 3 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Updated 11 months ago
- ☆60Updated 6 months ago
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆34Updated 8 months ago
- ☆220Updated 9 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆96Updated 3 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆178Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆72Updated 9 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆68Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆63Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆271Updated last year
- The repo for In-context Autoencoder☆162Updated last year