Furyton / awesome-language-model-analysisView external linksLinks
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
☆98Dec 2, 2024Updated last year
Alternatives and similar repositories for awesome-language-model-analysis
Users that are interested in awesome-language-model-analysis are comparing it to the libraries listed below
Sorting:
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- Welcome to the 'In Context Learning Theory' Reading Group☆30Nov 8, 2024Updated last year
- [WSDM 2024 Best Paper Honorable Mention] Debiasing Sequential Recommenders through Distributionally Robust Optimization over System Expos…☆15Jun 20, 2024Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated 11 months ago
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]☆23Nov 3, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Semi-automated OpenVINO benchmark_app with variable parameters. User can specify multiple options for any parameters in the benchmark_app…☆10Apr 19, 2022Updated 3 years ago
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆56Feb 4, 2026Updated last week
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆205Dec 27, 2024Updated last year
- DeepRAG: Thinking to Retrieve Step by Step for Large Language Models☆32May 17, 2025Updated 8 months ago
- Imagen-mini for girl image generation☆12Nov 19, 2022Updated 3 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- This repository has been redirected into https://kuaisar.github.io/.☆11Oct 12, 2023Updated 2 years ago
- DNN Node Collection using Inference Helper in ROS2☆13Apr 24, 2022Updated 3 years ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated 11 months ago
- albumentations test☆11Jun 23, 2020Updated 5 years ago
- ChatGPTをLINE botで触るハンズオン☆18Jun 28, 2023Updated 2 years ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated last week
- ☆53May 20, 2024Updated last year
- ☆35Feb 26, 2024Updated last year
- PyCon JP 2020 チュートリアルで利用する資料です☆10Aug 30, 2020Updated 5 years ago
- 「M1 MacにPythonの環境構築してみた」動画の資料です☆17Mar 24, 2021Updated 4 years ago
- A curated list of awesome Deep Learning theories that shed light on the mysteries of DL☆10Jul 20, 2018Updated 7 years ago
- Open Source Hardware "KOROBO (2-1 gen)"☆19Oct 18, 2025Updated 3 months ago
- Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation☆34May 8, 2024Updated last year
- ☆17Jun 14, 2024Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆21Jul 16, 2023Updated 2 years ago
- Data and code for understanding and generation of Kamon.☆31Updated this week
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆123Feb 3, 2026Updated last week
- Dockerfiles to use ROS with osrf/rocker☆18Dec 12, 2025Updated 2 months ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated 9 months ago
- ☆17Feb 26, 2024Updated last year
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated 11 months ago
- ベクトルタイルを用いた3D風地図☆16Jul 3, 2020Updated 5 years ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago
- ☆21Dec 30, 2022Updated 3 years ago
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- NSFW画像検出モデル(open_nsfw_android)をColaboratory上で動かすサンプル☆17Dec 6, 2021Updated 4 years ago
- ROCKNIX FORK☆18Dec 6, 2025Updated 2 months ago