socialfoundations / folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
☆21Updated last month
Alternatives and similar repositories for folktexts
Users that are interested in folktexts are comparing it to the libraries listed below
Sorting:
- Achieve error-rate fairness between societal groups for any score-based classifier.☆17Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆42Updated 2 months ago
- Extending Conformal Prediction to LLMs☆66Updated 10 months ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated 3 months ago
- Learning clinical-decision rules with interpretable models.☆20Updated last year
- Efficient multi-prompt evaluation of LLMs☆19Updated 5 months ago
- Fairness toolkit for pytorch, scikit learn and autogluon☆32Updated 5 months ago
- A collection of implementations of fair ML algorithms☆12Updated 7 years ago
- In-context Example Selection with Influences☆15Updated 2 years ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆18Updated 8 months ago
- ☆35Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆41Updated 2 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆12Updated 4 months ago
- ☆40Updated 5 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Data Benchmarking☆19Updated 11 months ago
- Official Repository for Dataset Inference for LLMs☆33Updated 9 months ago
- ☆48Updated last month
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆12Updated last year
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆18Updated last year
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique☆15Updated 8 months ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆25Updated 6 months ago
- The Prism Alignment Project☆75Updated last year
- A Natural Language Interface to Explainable Boosting Machines☆66Updated 10 months ago
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆66Updated 2 years ago
- ☆43Updated last year
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆17Updated 10 months ago