AI4LIFE-GROUP / LLM_ExplainerLinks
Code for paper: Are Large Language Models Post Hoc Explainers?
☆34Updated last year
Alternatives and similar repositories for LLM_Explainer
Users that are interested in LLM_Explainer are comparing it to the libraries listed below
Sorting:
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆99Updated 8 months ago
- A repository for summaries of recent explainable AI/Interpretable ML approaches☆84Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆83Updated last year
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆39Updated last year
- Conformal Language Modeling☆32Updated last year
- ☆100Updated last year
- ☆38Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆76Updated last year
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆133Updated last year
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆130Updated 11 months ago
- Using Explanations as a Tool for Advanced LLMs☆67Updated last year
- ☆32Updated last year
- ☆151Updated 2 years ago
- ☆56Updated 2 years ago
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆20Updated 4 years ago
- A fast, effective data attribution method for neural networks in PyTorch☆218Updated 10 months ago
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆128Updated 7 months ago
- ☆32Updated 10 months ago
- FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods.☆30Updated last year
- A resource repository for representation engineering in large language models☆138Updated 10 months ago
- A curated list of papers and resources about the distribution shift in machine learning.☆123Updated 2 years ago
- ☆178Updated last year
- Data and code for the Corr2Cause paper (ICLR 2024)☆111Updated last year
- ☆46Updated 8 months ago
- A Python Data Valuation Package☆30Updated 2 years ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆128Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆81Updated 7 months ago
- A simple PyTorch implementation of influence functions.☆91Updated last year
- ☆13Updated 2 years ago
- The TABLET benchmark for evaluating instruction learning with LLMs for tabular prediction.☆22Updated 2 years ago