socialfoundations/folktexts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/socialfoundations/folktexts)

socialfoundations / folktexts

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!

☆29

Alternatives and similar repositories for folktexts

Users that are interested in folktexts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

feedzai / fairgbm
View on GitHub
Train Gradient Boosting models that are both high-performance *and* Fair!
☆109Jul 15, 2026Updated last week
felipemaiapolo / prompteval
View on GitHub
Efficient multi-prompt evaluation of LLMs
☆33Dec 6, 2024Updated last year
DingfanChen / Private-Set
View on GitHub
Official implementation of "Private Set Generation with Discriminative Information" (NeurIPS 2022)
☆18Aug 14, 2023Updated 2 years ago
twistedcubic / coin-press
View on GitHub
[NeurIPS 2020] Simple and practical private mean and covariance estimation.
☆35Oct 4, 2020Updated 5 years ago
jcperdomo / performative-prediction
View on GitHub
☆34Jan 13, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
nikhilchandak / answer-matching
View on GitHub
Code for 'Answer Matching Outperforms Multiple Choice for Language Model Evaluation' paper
☆18Jul 4, 2025Updated last year
mrtzh / Ladder.jl
View on GitHub
A reliable leaderboard algorithm for machine learning competitions
☆17May 19, 2015Updated 11 years ago
Philip-MIT / thread
View on GitHub
☆22Aug 18, 2024Updated last year
socialfoundations / tttlm
View on GitHub
Test-time-training on nearest neighbors for large language models
☆50Apr 18, 2024Updated 2 years ago
feedzai / bank-account-fraud
View on GitHub
Supporting documentation for the paper "Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation", and the Bank…
☆88Dec 21, 2023Updated 2 years ago
socialfoundations / causal-features
View on GitHub
Code to reproduce the paper "Do causal predictors generalize better to new domains?"
☆17Feb 7, 2025Updated last year
JuliaDecisionFocusedLearning / DifferentiableExpectations.jl
View on GitHub
A Julia package for differentiating through expectations with Monte-Carlo estimates
☆16Nov 25, 2024Updated last year
edchengg / VAE_GAN
View on GitHub
VAE+GAN
☆10Apr 18, 2018Updated 8 years ago
QuantEcon / continuous_time_mcs
View on GitHub
Continuous Time Markov Chains
☆11Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
garysieling / wikipedia-categorization
View on GitHub
☆16Feb 8, 2019Updated 7 years ago
white07S / ForexRL
View on GitHub
A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management
☆13Jan 11, 2023Updated 3 years ago
zwChan / Clinical-Text-Mining
View on GitHub
Clinical Text Mining
☆12Aug 15, 2017Updated 8 years ago
nestoralvaro / TwiMed
View on GitHub
TwiMed: Twitter and PubMed Comparable Corpus of Drugs, Diseases, Symptoms and their Relations
☆11May 24, 2017Updated 9 years ago
evalscience / deepgov-gg23
View on GitHub
☆12Jun 26, 2025Updated last year
max-muoto / monty-dspy-rlm
View on GitHub
Example for a Monty-enabled RLM in DSPy
☆20Feb 16, 2026Updated 5 months ago
MaayanLab / SEP-L1000
View on GitHub
Website for visualizing predicted drug side-effects using L1000 data (http://maayanlab.net/SEP-L1000/)
☆10Apr 15, 2022Updated 4 years ago
feast-dev / feast-gcp-fraud-tutorial
View on GitHub
Resources backing the Feast fraud tutorial on GCP
☆15May 31, 2022Updated 4 years ago
matpalm / collocations
View on GitHub
bigram / trigram analysis of wikipedia; mainly mutual info
☆22Mar 6, 2012Updated 14 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Sigfried / merki
View on GitHub
Medication Extraction and Reconciliation Knowledge Instrument
☆13Jun 17, 2026Updated last month
ningyu1991 / ScalableGANFingerprints
View on GitHub
The official TensorFlow implementation for ICLR'22 Spotlight paper 'Responsible Disclosure of Generative Models Using Scalable Fingerprin…
☆33Apr 16, 2023Updated 3 years ago
chl8856 / SurvivalQuilts
View on GitHub
SurvivalQuilts: Temporal Quilting for Survival Analysis
☆11Jan 9, 2024Updated 2 years ago
lab-smile / DOMINO
View on GitHub
☆11Nov 19, 2025Updated 8 months ago
edilsonacjr / semeval2017
View on GitHub
NILC-USP at SemEval-2017 Task 4: A Multi-view Ensemble for Twitter Sentiment Analysis
☆10Feb 19, 2017Updated 9 years ago
ewsheng / decoding-biases
View on GitHub
Scripts to evaluate various bias metrics for different NLG models + decoding algorithms
☆16Dec 6, 2023Updated 2 years ago
thu-ml / LM-Calibration
View on GitHub
☆17May 31, 2023Updated 3 years ago
kozodoi / DMC_2020
View on GitHub
Profit-driven demand forecasting with gradient boosted trees
☆11Mar 28, 2023Updated 3 years ago
FDA / VAERS-Annotations
View on GitHub
Vaccine Adverse Events Annoted
☆14Jun 8, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
baaesh / semeval19_task3
View on GitHub
Addressing Training-Test Class Distribution Mismatch in Conversational Classification for SemEval-2019 Task3 EmoContext
☆10Apr 9, 2019Updated 7 years ago
NeilPearson-Lilly / TractaViewer
View on GitHub
☆11Jan 20, 2020Updated 6 years ago
niderhoff / reddit-cnn
View on GitHub
Training a Convolutional Neural Network on reddit comments to predict upvotes
☆16Jan 30, 2017Updated 9 years ago
Xingrun-Xing2 / EfficientLLM
View on GitHub
A family of efficient edge language models in 100M~1B sizes.
☆19Feb 14, 2025Updated last year
SagiLevanon1 / scmp
View on GitHub
☆10Jun 13, 2021Updated 5 years ago
jasonost / clinicaltrials
View on GitHub
☆13Sep 10, 2015Updated 10 years ago
wiebket / bt4vt
View on GitHub
Bias Tests for Voice Technologies (bt4vt)
☆11Jun 16, 2024Updated 2 years ago