AI4LIFE-GROUP / LLM_ExplainerLinks
Code for paper: Are Large Language Models Post Hoc Explainers?
☆34Updated last year
Alternatives and similar repositories for LLM_Explainer
Users that are interested in LLM_Explainer are comparing it to the libraries listed below
Sorting:
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆47Updated 2 years ago
- Conformal Language Modeling☆32Updated 2 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆84Updated last year
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆99Updated 11 months ago
- ☆41Updated last year
- ☆103Updated last year
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆133Updated last year
- A repository for summaries of recent explainable AI/Interpretable ML approaches☆88Updated last year
- ☆158Updated 2 years ago
- FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods.☆31Updated last year
- ☆183Updated last year
- Using Explanations as a Tool for Advanced LLMs☆69Updated last year
- ☆58Updated 2 years ago
- ☆33Updated last year
- ☆48Updated 11 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆85Updated 10 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆142Updated last year
- ☆33Updated last year
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆32Updated last year
- This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".☆24Updated last year
- Fairness in LLMs resources☆39Updated 2 weeks ago
- A Python Data Valuation Package☆31Updated 2 years ago
- A resource repository for representation engineering in large language models☆148Updated last year
- Uncertainty quantification for in-context learning of large language models☆16Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆227Updated last year
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Updated 2 years ago
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆20Updated 5 years ago
- ☆13Updated 3 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Updated 2 years ago