☆106Jun 30, 2024Updated last year
Alternatives and similar repositories for UQ-NLG
Users that are interested in UQ-NLG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆185Jun 20, 2024Updated last year
- SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models☆17Jun 24, 2024Updated last year
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated 2 years ago
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆62Sep 4, 2024Updated last year
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆36Dec 17, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- ☆471Updated this week
- MetaC provides a read-eval-print loop (a REPL) and notebook interactive development environment (a NIDE) for C programming. MetaC also …☆12Mar 29, 2026Updated last month
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Aug 15, 2024Updated last year
- Locally Valid and Discriminative Prediction Intervals for Deep Learning Models☆13May 22, 2023Updated 2 years ago
- ☆46Dec 9, 2024Updated last year
- ☆42Feb 2, 2024Updated 2 years ago
- Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models☆818Apr 23, 2026Updated 2 weeks ago
- Likelihood-Free Frequentist Inference☆21May 2, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024☆22Nov 20, 2024Updated last year
- Codebase for information theoretic shapley values to explain predictive uncertainty.This repo contains the code related to the paperWatso…☆22Jul 4, 2024Updated last year
- ☆32Feb 13, 2024Updated 2 years ago
- ☆59Jul 31, 2024Updated last year
- This repo contains the source code for reproducing the experimental results in semantic density paper (Neurips 2024)☆19Sep 28, 2025Updated 7 months ago
- The implementation for ACL 2022 paper☆20Aug 14, 2022Updated 3 years ago
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆20Nov 8, 2023Updated 2 years ago
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Jan 14, 2023Updated 3 years ago
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆14Jul 17, 2025Updated 9 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆149Mar 14, 2024Updated 2 years ago
- This project collects methods that enhance the comparison between AMR graphs.☆11Jun 15, 2023Updated 2 years ago
- ☆20May 3, 2025Updated last year
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 4 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆578Feb 12, 2024Updated 2 years ago
- "Oblique Decision Trees from Derivatives of ReLU Networks" (ICLR 2020, previously called "Locally Constant Networks")☆21Apr 27, 2021Updated 5 years ago
- Functional Benchmarks and the Reasoning Gap☆90Oct 1, 2024Updated last year
- ☆191Mar 8, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Apr 4, 2023Updated 3 years ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19Apr 1, 2024Updated 2 years ago
- ☆20Nov 3, 2024Updated last year
- Source code of "What Makes Graph Neural Networks Miscalibrated?" (NeurIPS 2022)☆23Jun 9, 2025Updated 11 months ago
- ☆16Oct 3, 2023Updated 2 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Jun 4, 2024Updated last year
- Random Pluto notebooks in Julia☆12Oct 23, 2025Updated 6 months ago