UCSB-NLP-Chang/llm_uncertainty

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UCSB-NLP-Chang/llm_uncertainty)

UCSB-NLP-Chang / llm_uncertainty

☆43

Alternatives and similar repositories for llm_uncertainty

Users that are interested in llm_uncertainty are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lingchen0331 / UQ_ICL
View on GitHub
Uncertainty quantification for in-context learning of large language models
☆15Apr 1, 2024Updated 2 years ago
AlexanderVNikitin / kernel-language-entropy
View on GitHub
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)
☆36Dec 17, 2024Updated last year
lisa-wm / entropybaseduq
View on GitHub
☆12Apr 4, 2025Updated last year
aidos-lab / magnipy
View on GitHub
Metric Space Magnitude Computations
☆15Jun 30, 2026Updated 3 weeks ago
intuit-ai-research / SPUQ
View on GitHub
SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models
☆17Jun 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MiaoXiong2320 / llm-uncertainty
View on GitHub
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
☆148Mar 14, 2024Updated 2 years ago
myracheng / lm_caricature
View on GitHub
code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
☆11Oct 13, 2023Updated 2 years ago
SeanLeng1 / Reward-Calibration
View on GitHub
☆21Dec 14, 2024Updated last year
tatsu-lab / linguistic_calibration
View on GitHub
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆29Jun 4, 2024Updated 2 years ago
WANGXinyiLinda / LM_random_walk
View on GitHub
Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
☆21Feb 29, 2024Updated 2 years ago
Ybakman / LLM_Uncertainty
View on GitHub
☆12Sep 22, 2024Updated last year
myracheng / anthroscore
View on GitHub
Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text
☆19Mar 31, 2025Updated last year
zlin7 / UQ-NLG
View on GitHub
☆106Jun 30, 2024Updated 2 years ago
jzbjyb / lm-calibration
View on GitHub
☆34Nov 17, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sauc-abadal / ALT
View on GitHub
Official repository for ALT (ALignment with Textual feedback).
☆10Jul 25, 2024Updated 2 years ago
jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness
View on GitHub
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
☆832Jun 5, 2026Updated last month
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated 2 years ago
jordemort / action-pyright
View on GitHub
A GitHub Action to run pyright
☆12Dec 5, 2024Updated last year
IINemo / lm-polygraph
View on GitHub
☆495May 18, 2026Updated 2 months ago
OSU-NLP-Group / llm-planning-eval
View on GitHub
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Feb 23, 2024Updated 2 years ago
NExTplusplus / L2I
View on GitHub
The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573
☆13Aug 2, 2022Updated 3 years ago
zhangxin-xd / Dataset-Pruning-TDDS
View on GitHub
The official implementation of paper "Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning" （CVPR …
☆21Aug 20, 2024Updated last year
harris-chris / joint-shapley-values
View on GitHub
Source code for the Joint Shapley values: a measure of joint feature importance
☆12Sep 14, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
FightPandemics / FightPandemics-android
View on GitHub
Android app for the FightPandemics platform
☆12Mar 20, 2021Updated 5 years ago
stellalisy / mediQ
View on GitHub
☆43Jan 26, 2025Updated last year
Bowen1911 / Difficulty-Perception-of-LLMs
View on GitHub
Code of paper: Probing the Difficulty Perception Mechanism of Large Language Models
☆18Mar 17, 2026Updated 4 months ago
AngelaZZZ-611 / reasoning_models_probing
View on GitHub
☆22May 14, 2026Updated 2 months ago
yanshuotan / st5201x
View on GitHub
Code and resources for NUS ST5201X
☆13Nov 5, 2022Updated 3 years ago
dataplayer12 / swish-activation
View on GitHub
Repo for my blogs explaining swish activation function
☆13Dec 17, 2017Updated 8 years ago
smartyfh / LLM-Uncertainty-Bench
View on GitHub
Benchmarking LLMs via Uncertainty Quantification
☆263Jan 30, 2024Updated 2 years ago
ZhihongShao / RECTIFY
View on GitHub
Code and models for ``Answering Open-Domain Multi-Answer Questions via a Recall-then-Verify Framework (ACL 2022)''
☆12Jun 29, 2022Updated 4 years ago
bobxwu / CFDTM
View on GitHub
Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion (ACL 2024 Findin…
☆16Aug 23, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ToolBeHonest / ToolBeHonest
View on GitHub
[EMNLP 2024] A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models.
☆22Sep 23, 2024Updated last year
srzer / MOD
View on GitHub
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆30Oct 30, 2024Updated last year
Jiuzhouh / Uncertainty-Aware-Language-Agent
View on GitHub
This is the official repo for Towards Uncertainty-Aware Language Agent.
☆31Aug 15, 2024Updated last year
enrico310786 / video_classification
View on GitHub
Train and test video classifier models with PyTorchVideo
☆15Nov 18, 2022Updated 3 years ago
Karim-53 / Compare-xAI
View on GitHub
🧪 A unified benchmark to evaluate & compare Explainable AI methods (SHAP, LIME, ...) via functional tests. Live results + paper (arXiv:2…
☆14Jul 3, 2026Updated 3 weeks ago
LiangruXie / Calibration-Process-in-Black-Box-LLMs
View on GitHub
☆21Nov 26, 2024Updated last year
dengyang17 / PACIFIC
View on GitHub
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
☆14May 15, 2024Updated 2 years ago