hy-zhao23 / Explainability-for-Large-Language-ModelsLinks

☆154

Alternatives and similar repositories for Explainability-for-Large-Language-Models

Users that are interested in Explainability-for-Large-Language-Models are comparing it to the libraries listed below

Sorting:

LuckyyySTA / Awesome-LLM-hallucination
LLM hallucination paper list
☆323Updated last year
xiaoya-li / Instruction-Tuning-Survey
Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`
☆188Updated 2 months ago
HITsz-TMG / awesome-llm-attributions
A Survey of Attributions for Large Language Models
☆216Updated last year
alon-albalak / data-selection-survey
A Survey on Data Selection for Language Models
☆250Updated 5 months ago
cooperleong00 / Awesome-LLM-Interpretability
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
☆273Updated 6 months ago
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆138Updated last year
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆175Updated last year
zjunlp / KnowledgeCircuits
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
☆158Updated 7 months ago
jlko / semantic_uncertainty
Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).
☆371Updated last year
Magnetic2014 / llm-alignment-survey
A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…
☆80Updated 2 years ago
JacksonWuxs / UsableXAI_LLM
Using Explanations as a Tool for Advanced LLMs
☆67Updated last year
hyintell / awesome-refreshing-llms
EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.
☆135Updated last year
wangcunxiang / LLM-Factuality-Survey
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
☆340Updated last year
SuperBruceJia / Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
☆109Updated 2 months ago
MiuLab / PersonaLLM-Survey
☆100Updated last year
HowieHwong / DataGen
[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models
☆64Updated 7 months ago
AGI-Edgerunners / LLM-Continual-Learning-Papers
Must-read Papers on Large Language Model (LLM) Continual Learning
☆145Updated last year
lyy1994 / awesome-data-contamination
The Paper List on Data Contamination for Large Language Models Evaluation.
☆100Updated last month
Glaciohound / LM-Steer
Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)
☆124Updated 2 months ago
WangRongsheng / Awesome-LLM-with-RAG
A curated list of Large Language Model with RAG
☆81Updated last year
chrisliu298 / awesome-representation-engineering
A resource repository for representation engineering in large language models
☆136Updated 10 months ago
zorazrw / awesome-tool-llm
☆239Updated last year
swj0419 / detect-pretrain-code
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆233Updated last year
ZFancy / awesome-activation-engineering
A curated list of resources for activation engineering
☆105Updated last week
lorenzkuhn / semantic_uncertainty
☆178Updated last year
junchenzhi / Awesome-LLM-Ensemble
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
☆137Updated this week
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆61Updated last year
hongbinye / Cognitive-Mirage-Hallucinations-in-LLMs
Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"
☆47Updated last year
weizhepei / InstructRAG
[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
☆124Updated 8 months ago
jlko / long_hallucinations
Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).
☆70Updated last year