JShollaj / awesome-llm-interpretabilityLinks
A curated list of Large Language Model (LLM) Interpretability resources.
☆1,356Updated 6 months ago
Alternatives and similar repositories for awesome-llm-interpretability
Users that are interested in awesome-llm-interpretability are comparing it to the libraries listed below
Sorting:
- LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. …☆822Updated 6 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,387Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,166Updated last year
- List of papers on hallucination detection in LLMs.☆896Updated last week
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,490Updated 4 months ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆836Updated 10 months ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,306Updated 2 weeks ago
- System 2 Reasoning Link Collection☆838Updated 3 months ago
- Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models☆759Updated last month
- This repository collects all relevant resources about interpretability in LLMs☆358Updated 7 months ago
- A library for advanced large language model reasoning☆2,148Updated last week
- A unified evaluation framework for large language models☆2,641Updated 3 weeks ago
- All the projects related to Llama☆380Updated 2 months ago
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.☆771Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,897Updated 10 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆857Updated 2 weeks ago
- Robust recipes to align language models with human and AI preferences☆5,232Updated last month
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,098Updated 3 weeks ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'☆1,541Updated 4 months ago
- A bibliography and survey of the papers surrounding o1☆1,199Updated 7 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,544Updated last year
- Best practices for distilling large language models.☆553Updated last year
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,773Updated 5 months ago
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆756Updated 3 weeks ago
- Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models☆1,231Updated 3 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆982Updated 11 months ago
- Curated list of datasets and tools for post-training.☆3,158Updated 4 months ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,570Updated 7 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,434Updated 5 months ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆952Updated last month