AI4LIFE-GROUP / LLM_Explainer
Code for paper: Are Large Language Models Post Hoc Explainers?
☆31Updated 9 months ago
Alternatives and similar repositories for LLM_Explainer:
Users that are interested in LLM_Explainer are comparing it to the libraries listed below
- ☆42Updated 3 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆77Updated last year
- ☆29Updated last year
- Conformal Language Modeling☆28Updated last year
- Uncertainty quantification for in-context learning of large language models☆16Updated last year
- ☆88Updated 10 months ago
- ☆23Updated 5 months ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆32Updated last year
- Using Explanations as a Tool for Advanced LLMs☆60Updated 7 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆64Updated 7 months ago
- A repository for summaries of recent explainable AI/Interpretable ML approaches☆74Updated 7 months ago
- ☆25Updated last year
- ☆31Updated last year
- A Python Data Valuation Package☆30Updated 2 years ago
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆18Updated last year
- FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods.☆28Updated 11 months ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Updated last year
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆96Updated 3 months ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆18Updated last year
- ☆60Updated 3 years ago
- ☆50Updated last year
- ☆43Updated 2 years ago
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆14Updated 4 months ago
- A simple PyTorch implementation of influence functions.☆85Updated 10 months ago
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆20Updated 4 years ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆66Updated 2 years ago
- Official Repository for Dataset Inference for LLMs☆33Updated 9 months ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆75Updated 4 months ago
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆41Updated 3 weeks ago
- The TABLET benchmark for evaluating instruction learning with LLMs for tabular prediction.☆21Updated 2 years ago