socialfoundations / folktextsLinks
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
☆22Updated 2 months ago
Alternatives and similar repositories for folktexts
Users that are interested in folktexts are comparing it to the libraries listed below
Sorting:
- Achieve error-rate fairness between societal groups for any score-based classifier.☆18Updated last year
- Efficient multi-prompt evaluation of LLMs☆19Updated 6 months ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆18Updated 9 months ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated 4 months ago
- A collection of implementations of fair ML algorithms☆12Updated 7 years ago
- A Natural Language Interface to Explainable Boosting Machines☆67Updated 11 months ago
- Extending Conformal Prediction to LLMs☆66Updated last year
- Fairness toolkit for pytorch, scikit learn and autogluon☆32Updated 6 months ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆42Updated 3 months ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆26Updated 7 months ago
- ☆31Updated 5 years ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Explainable Artificial Intelligence through Contextual Importance and Utility☆28Updated 10 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆119Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆41Updated 2 years ago
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆68Updated 2 years ago
- Data Benchmarking☆21Updated last year
- In-context Example Selection with Influences☆15Updated 2 years ago
- ☆48Updated 3 weeks ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Updated last year
- The Prism Alignment Project☆77Updated last year
- Describing changes in LLM research trends in 2023. https://arxiv.org/abs/2307.10700☆18Updated last year
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆18Updated 2 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆18Updated 2 weeks ago
- ☆23Updated last year
- Learning clinical-decision rules with interpretable models.☆20Updated last year
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- [Experimental] Causal graphs that are networkx-compliant for the py-why ecosystem.☆56Updated this week
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago