This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆100Jun 3, 2026Updated last week
Alternatives and similar repositories for awesome-language-model-analysis
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- [NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.☆47Oct 29, 2025Updated 7 months ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a brief repo about paper research☆15Sep 4, 2024Updated last year
- ☆31Mar 17, 2026Updated 2 months ago
- DeepRAG: Thinking to Retrieve Step by Step for Large Language Models☆39Feb 17, 2026Updated 3 months ago
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]☆27Nov 3, 2024Updated last year
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆411Mar 2, 2025Updated last year
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆211Apr 13, 2026Updated 2 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- Codes for the paper The emergence of clusters in self-attention dynamics.☆18Dec 18, 2023Updated 2 years ago
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆63Mar 18, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [TOIS 2023] On the User Behavior Leakage from Recommender System Exposure☆19Nov 7, 2023Updated 2 years ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 3 years ago
- This repository has been redirected into https://kuaisar.github.io/.☆11Oct 12, 2023Updated 2 years ago
- ☆15Sep 21, 2022Updated 3 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆30Feb 6, 2026Updated 4 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆19Feb 20, 2025Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆48Oct 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆307Jan 22, 2026Updated 4 months ago
- Deep Learning Experiments Motivated from Fastai Course☆14Jan 2, 2019Updated 7 years ago
- ☆54May 20, 2024Updated 2 years ago
- ☆245May 10, 2024Updated 2 years ago
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆127Feb 3, 2026Updated 4 months ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated last year
- [CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"☆17Jun 9, 2025Updated last year
- ☆17Jun 14, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 処理の検証や比較検討での用途を想定したノードエディターベースの画像処理アプリ☆11Mar 5, 2023Updated 3 years ago
- ☆47Aug 26, 2025Updated 9 months ago
- メイカーの交流を円滑に進めるための<心がまえ>を明文化するプロジェクト☆10May 1, 2020Updated 6 years ago
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- A curated list of Survey Papers on Deep Learning.☆12Sep 5, 2023Updated 2 years ago
- Imagen-mini for girl image generation☆12Nov 19, 2022Updated 3 years ago
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆16Feb 27, 2025Updated last year