Furyton / awesome-language-model-analysisLinks
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆92Updated 10 months ago
Alternatives and similar repositories for awesome-language-model-analysis
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
Sorting:
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆177Updated last week
- ☆174Updated 5 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆118Updated 2 months ago
- ☆129Updated 7 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆273Updated 7 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆31Updated 7 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆113Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆121Updated last year
- awesome SAE papers☆51Updated 5 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆125Updated 3 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆80Updated 10 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆89Updated last week
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆130Updated 7 months ago
- A curated list of resources for activation engineering☆107Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆247Updated 3 weeks ago
- ☆334Updated 2 months ago
- 📜 Paper list on decoding methods for LLMs and LVLMs☆61Updated 3 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆101Updated last month
- Function Vectors in Large Language Models (ICLR 2024)☆181Updated 6 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆188Updated 9 months ago
- ☆275Updated 3 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆125Updated 5 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆79Updated 2 years ago
- ☆54Updated 3 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆61Updated last year
- Collection of Reverse Engineering in Large Model☆34Updated 9 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆64Updated 3 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆259Updated last year
- A Sober Look at Language Model Reasoning☆85Updated 2 weeks ago
- A Survey on Data Selection for Language Models☆250Updated 5 months ago