JShollaj / awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
☆1,136Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-llm-interpretability
- ReFT: Representation Finetuning for Language Models☆1,145Updated this week
- A unified evaluation framework for large language models☆2,447Updated last week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,333Updated 6 months ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆792Updated 2 months ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆718Updated 2 months ago
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆1,965Updated 2 weeks ago
- List of papers on hallucination detection in LLMs.☆666Updated last week
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,034Updated 6 months ago
- LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. …☆766Updated 3 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,026Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆1,797Updated last week
- Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions☆633Updated this week
- Training LLMs with QLoRA + FSDP☆1,419Updated this week
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,504Updated 3 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆787Updated last week
- This repository collects all relevant resources about interpretability in LLMs☆280Updated last week
- A library for advanced large language model reasoning☆1,418Updated 2 months ago
- System 2 Reasoning Link Collection☆683Updated last week
- Automatically evaluate your LLMs in Google Colab☆556Updated 6 months ago
- All the projects related to Llama☆369Updated 2 weeks ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆894Updated this week
- ☆1,878Updated last week
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...☆1,557Updated last month
- ☆1,263Updated this week
- Reaching LLaMA2 Performance with 0.1M Dollars☆960Updated 3 months ago
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆761Updated 2 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆738Updated last week
- ☆889Updated 3 weeks ago
- ☆903Updated this week