socialfoundations / folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
☆19Updated last week
Alternatives and similar repositories for folktexts:
Users that are interested in folktexts are comparing it to the libraries listed below
- Achieve error-rate fairness between societal groups for any score-based classifier.☆16Updated 9 months ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆39Updated 10 months ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated last week
- Extending Conformal Prediction to LLMs☆63Updated 8 months ago
- Experimental library integrating LLM capabilities to support causal analyses☆106Updated 5 months ago
- ☆36Updated last year
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆62Updated 2 years ago
- A Natural Language Interface to Explainable Boosting Machines☆64Updated 7 months ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆93Updated 10 months ago
- Conformal Language Modeling☆28Updated last year
- ☆31Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆109Updated 8 months ago
- A collection of implementations of fair ML algorithms☆12Updated 7 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆24Updated 3 months ago
- ☆65Updated 10 months ago
- Code to reproduce our paper on probabilistic algorithmic recourse: https://arxiv.org/abs/2006.06831☆36Updated 2 years ago
- ☆81Updated last week
- Code for paper: Are Large Language Models Post Hoc Explainers?☆30Updated 6 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Efficient multi-prompt evaluation of LLMs☆19Updated 2 months ago
- Learning clinical-decision rules with interpretable models.☆20Updated last year
- ☆43Updated 9 months ago
- ☆30Updated 3 years ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆27Updated 6 months ago
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆18Updated 10 months ago
- Conformal prediction for controlling monotonic risk functions. Simple accompanying PyTorch code for conformal risk control in computer vi…☆62Updated 2 years ago
- PyTorch package to train and audit ML models for Individual Fairness☆64Updated last year
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆17Updated 5 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated last year