socialfoundations / folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
☆20Updated 2 weeks ago
Alternatives and similar repositories for folktexts:
Users that are interested in folktexts are comparing it to the libraries listed below
- Achieve error-rate fairness between societal groups for any score-based classifier.☆17Updated 11 months ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆18Updated 7 months ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆42Updated last month
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated 2 months ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- Extending Conformal Prediction to LLMs☆66Updated 10 months ago
- Fairness toolkit for pytorch, scikit learn and autogluon☆32Updated 4 months ago
- ☆37Updated 4 months ago
- ☆41Updated last year
- In-context Example Selection with Influences☆15Updated last year
- ☆34Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆18Updated 2 years ago
- ☆90Updated 2 months ago
- Efficient multi-prompt evaluation of LLMs☆19Updated 4 months ago
- A Natural Language Interface to Explainable Boosting Machines☆66Updated 9 months ago
- Documenting large text datasets 🖼️ 📚☆12Updated 4 months ago
- ☆23Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- Conformal Language Modeling☆28Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆12Updated last year
- Official Repository for Dataset Inference for LLMs☆33Updated 8 months ago
- Repository of experiments in fairness Machine Learning.☆9Updated 10 months ago
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆12Updated 3 years ago
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆53Updated last year
- ☆43Updated 5 months ago
- Data Benchmarking☆19Updated 10 months ago
- ☆48Updated last month
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆64Updated 2 years ago
- ☆31Updated 3 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆25Updated 5 months ago