sylinrl/CalibratedMath

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sylinrl/CalibratedMath)

sylinrl / CalibratedMath

Teaching Models to Express Their Uncertainty in Words

☆38

Alternatives and similar repositories for CalibratedMath

Users that are interested in CalibratedMath are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yinzhangyue / SelfAware
View on GitHub
Do Large Language Models Know What They Don’t Know?
☆103Nov 8, 2024Updated last year
csinva / mdl-complexity
View on GitHub
MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".
☆18Jun 12, 2023Updated 3 years ago
zlin7 / UQ-NLG
View on GitHub
☆106Jun 30, 2024Updated 2 years ago
LLaMafia / SFT_function_learning
View on GitHub
Explore what LLMs are really leanring over SFT
☆28Mar 30, 2024Updated 2 years ago
allenai / feb
View on GitHub
Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"
☆12Apr 27, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
open-nlplab / fastchatgpt
View on GitHub
A python tool help to interact with chatgpt.
☆10Dec 11, 2022Updated 3 years ago
tatsu-lab / linguistic_calibration
View on GitHub
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆29Jun 4, 2024Updated 2 years ago
Jiuzhouh / Uncertainty-Aware-Language-Agent
View on GitHub
This is the official repo for Towards Uncertainty-Aware Language Agent.
☆31Aug 15, 2024Updated last year
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
Kaleidophon / awesome-experimental-standards-deep-learning
View on GitHub
Repository collecting resources and best practices to improve experimental rigour in deep learning research.
☆27Mar 30, 2023Updated 3 years ago
lifan-yuan / PLMCalibration
View on GitHub
Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"
☆11May 9, 2023Updated 3 years ago
CrowdTruth / Open-Domain-Relation-Extraction
View on GitHub
Crowdsourced data for open domain relation classification from sentences
☆20Oct 26, 2018Updated 7 years ago
facebookresearch / multiloko
View on GitHub
A benchmark with locally sourced multilingual questions for 31 languages.
☆18May 13, 2026Updated 2 months ago
Wenjun-Peng / GPT4SM
View on GitHub
☆11Jun 7, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
shuoli90 / Rank-Calibration
View on GitHub
This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.
☆14Apr 9, 2024Updated 2 years ago
shuyhere / about-super-alignment
View on GitHub
Feeling confused about super alignment? Here is a reading list
☆43Jan 9, 2024Updated 2 years ago
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
lorenzkuhn / semantic_uncertainty
View on GitHub
☆186Jun 20, 2024Updated 2 years ago
Bolin97 / awesome-instruction-selector
View on GitHub
Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning
☆48Jan 22, 2026Updated 5 months ago
p-lambda / in-n-out
View on GitHub
Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"
☆13Oct 23, 2021Updated 4 years ago
allenai / sso
View on GitHub
Repository for Skill Set Optimization
☆14Jul 26, 2024Updated last year
Hsuan-Tung / universal_attack_natural_trigger
View on GitHub
Natural Universal Trigger Search (NUTS)
☆21Apr 17, 2021Updated 5 years ago
Nanami18 / Snowballed_Hallucination
View on GitHub
☆43Sep 3, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
icip-cas / SSO
View on GitHub
A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…
☆20Nov 21, 2024Updated last year
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Updated this week
parameterlab / apricot
View on GitHub
Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024
☆22Nov 20, 2024Updated last year
kailas-v / human-ai-interactions
View on GitHub
☆11Oct 28, 2022Updated 3 years ago
atticusg / MultiplyQuantifiedData
View on GitHub
☆10Nov 1, 2019Updated 6 years ago
copenlu / awesome-text-interpretability
View on GitHub
A repo to keep all resources about interpretability in NLP organised and up to date
☆13Nov 22, 2020Updated 5 years ago
rdnfn / icai
View on GitHub
Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.
☆42May 6, 2026Updated 2 months ago
uiuctml / fair-classification
View on GitHub
Post-processing for fair classification
☆16Jun 30, 2025Updated last year
abhika-m / FAVA
View on GitHub
☆77Feb 16, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mdnunez / PIPS_course
View on GitHub
Programming in Psychological Science course. This repository contains materials for a R + Python intro course.
☆17May 3, 2023Updated 3 years ago
mymakar / causally_motivated_shortcut_removal
View on GitHub
☆14Jul 5, 2023Updated 3 years ago
lchen001 / HAPI
View on GitHub
☆16Nov 30, 2022Updated 3 years ago
feyzaakyurek / bbnli
View on GitHub
Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…
☆15Apr 28, 2022Updated 4 years ago
PremiLab-Math / MathCheck
View on GitHub
[ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
☆34Oct 23, 2024Updated last year
GAIR-NLP / alignment-for-honesty
View on GitHub
☆78May 22, 2024Updated 2 years ago
fdalvi / NeuroX
View on GitHub
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆108Oct 4, 2023Updated 2 years ago