socialfoundations / folktextsLinks
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
☆26Updated this week
Alternatives and similar repositories for folktexts
Users that are interested in folktexts are comparing it to the libraries listed below
Sorting:
- A collection of implementations of fair ML algorithms☆12Updated 7 years ago
- Fairness toolkit for pytorch, scikit learn and autogluon☆33Updated last month
- Extending Conformal Prediction to LLMs☆68Updated last year
- PAIR.withgoogle.com and friend's work on interpretability methods☆215Updated 2 weeks ago
- Efficient multi-prompt evaluation of LLMs☆24Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated last month
- PyTorch package to train and audit ML models for Individual Fairness☆66Updated 2 months ago
- Testing Language Models for Memorization of Tabular Datasets.☆36Updated 10 months ago
- A Natural Language Interface to Explainable Boosting Machines☆68Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆133Updated last year
- ⚖️ Code for the paper "Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning".☆11Updated 3 years ago
- Python client library for Cleanlab Trustworthy Language Model☆24Updated this week
- Data Benchmarking☆23Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆25Updated 3 years ago
- A lightweight implementation of removal-based explanations for ML models.☆59Updated 4 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆42Updated 3 years ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆21Updated 3 months ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Updated last year
- ☆79Updated last year
- Adversarial Attacks on Post Hoc Explanation Techniques (LIME/SHAP)☆85Updated 3 years ago
- The Prism Alignment Project☆86Updated last year
- Experimental library integrating LLM capabilities to support causal analyses☆265Updated 2 months ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆25Updated last year
- Python package to compute interaction indices that extend the Shapley Value. AISTATS 2023.☆18Updated 2 years ago
- Comparing fairness-aware machine learning techniques.☆160Updated 2 years ago
- Model Agnostic Counterfactual Explanations☆88Updated 3 years ago
- [ICML 2025] HypotheSAEs: Hypothesizing interpretable relationships in text datasets using sparse autoencoders. https://arxiv.org/abs/2502…☆65Updated last month
- [NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark☆226Updated last year
- Documenting large text datasets 🖼️ 📚☆14Updated 11 months ago
- Finding semantically meaningful and accurate prompts.☆48Updated 2 years ago