Furyton / awesome-language-model-analysisLinks
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆98Updated last year
Alternatives and similar repositories for awesome-language-model-analysis
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
Sorting:
- ☆197Updated this week
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆242Updated this week
- FeatureAlignment = Alignment + Mechanistic Interpretability☆33Updated 9 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆288Updated this week
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆142Updated last month
- ☆346Updated 4 months ago
- ☆136Updated 9 months ago
- ☆294Updated 5 months ago
- awesome SAE papers☆69Updated 7 months ago
- ☆72Updated 8 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆126Updated last year
- A Sober Look at Language Model Reasoning☆92Updated last month
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆157Updated 7 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆140Updated 4 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆92Updated last year
- ☆218Updated 9 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 9 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆173Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆71Updated 8 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆381Updated 2 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆317Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆270Updated last year
- A Survey on Data Selection for Language Models☆253Updated 7 months ago
- 📜 Paper list on decoding methods for LLMs and LVLMs☆68Updated last month
- A curated list of resources for activation engineering☆119Updated 2 months ago
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆34Updated 7 months ago
- ☆213Updated 10 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆133Updated 5 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆80Updated 2 years ago
- A brief and partial summary of RLHF algorithms.☆139Updated 9 months ago