Furyton / awesome-language-model-analysisLinks
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆84Updated 6 months ago
Alternatives and similar repositories for awesome-language-model-analysis
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
Sorting:
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆96Updated last week
- A curated list of resources for activation engineering☆83Updated last week
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆72Updated 2 years ago
- awesome SAE papers☆33Updated last week
- ☆131Updated 2 weeks ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆53Updated 2 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆243Updated 2 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆47Updated 2 weeks ago
- The official code repository for PRMBench.☆73Updated 3 months ago
- ☆57Updated this week
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆84Updated 3 months ago
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆21Updated 3 weeks ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- ☆64Updated last month
- A Sober Look at Language Model Reasoning☆52Updated last week
- ☆105Updated 2 months ago
- ☆38Updated 3 months ago
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆19Updated last week
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆145Updated 2 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆95Updated 2 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆123Updated 2 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated 2 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆79Updated 3 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆106Updated 5 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆164Updated last year
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆218Updated 2 weeks ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆110Updated last year
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆69Updated 3 months ago