socialfoundations / folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
☆20Updated last month
Alternatives and similar repositories for folktexts:
Users that are interested in folktexts are comparing it to the libraries listed below
- Achieve error-rate fairness between societal groups for any score-based classifier.☆16Updated 11 months ago
- Data Benchmarking☆19Updated 10 months ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆18Updated 6 months ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆41Updated 3 weeks ago
- Fairness toolkit for pytorch, scikit learn and autogluon☆31Updated 3 months ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated last month
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆113Updated 9 months ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- ☆22Updated last year
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆12Updated 3 years ago
- Extending Conformal Prediction to LLMs☆64Updated 9 months ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- A collection of implementations of fair ML algorithms☆12Updated 7 years ago
- ☆89Updated last month
- Conformal Language Modeling☆28Updated last year
- Experimental library integrating LLM capabilities to support causal analyses☆120Updated last week
- ☆34Updated last year
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Updated 2 years ago
- ☆38Updated last year
- Data and code for the Corr2Cause paper (ICLR 2024)☆96Updated 11 months ago
- In-context Example Selection with Influences☆15Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- The Prism Alignment Project☆69Updated 11 months ago
- Documenting large text datasets 🖼️ 📚☆11Updated 3 months ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Updated 2 years ago
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆63Updated 2 years ago
- ☆23Updated last year
- Causal Agent based on Large Language Model☆42Updated 7 months ago
- A Natural Language Interface to Explainable Boosting Machines☆65Updated 8 months ago