☆105Jun 30, 2024Updated last year
Alternatives and similar repositories for UQ-NLG
Users that are interested in UQ-NLG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆186Jun 20, 2024Updated last year
- SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models☆17Jun 24, 2024Updated last year
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated 2 years ago
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆35Dec 17, 2024Updated last year
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).☆413Apr 12, 2024Updated 2 years ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Aug 15, 2024Updated last year
- Locally Valid and Discriminative Prediction Intervals for Deep Learning Models☆13May 22, 2023Updated 3 years ago
- ☆47Dec 9, 2024Updated last year
- ☆42Feb 2, 2024Updated 2 years ago
- ☆25Jun 10, 2025Updated 11 months ago
- Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024☆22Nov 20, 2024Updated last year
- ☆32Feb 13, 2024Updated 2 years ago
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆18Nov 23, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆59Jul 31, 2024Updated last year
- The implementation for ACL 2022 paper☆20Aug 14, 2022Updated 3 years ago
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆20Nov 8, 2023Updated 2 years ago
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆30Jan 14, 2023Updated 3 years ago
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 3 years ago
- ☆14Jul 17, 2025Updated 10 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆148Mar 14, 2024Updated 2 years ago
- This project collects methods that enhance the comparison between AMR graphs.☆11Jun 15, 2023Updated 2 years ago
- ☆21May 14, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆585Feb 12, 2024Updated 2 years ago
- "Oblique Decision Trees from Derivatives of ReLU Networks" (ICLR 2020, previously called "Locally Constant Networks")☆21Apr 27, 2021Updated 5 years ago
- An empirical investigation of deep learning theory☆16Oct 3, 2019Updated 6 years ago
- Functional Benchmarks and the Reasoning Gap☆90Oct 1, 2024Updated last year
- ☆191Mar 8, 2026Updated 2 months ago
- ☆13Apr 4, 2023Updated 3 years ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19May 14, 2026Updated 2 weeks ago
- ☆20Nov 3, 2024Updated last year
- Source code of "What Makes Graph Neural Networks Miscalibrated?" (NeurIPS 2022)☆23Jun 9, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Jun 4, 2024Updated last year
- ☆88May 22, 2026Updated last week
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- [ICLR24] Official Repo of BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models☆53Jul 24, 2024Updated last year
- ☆27Apr 19, 2026Updated last month
- Code and dataset for the paper: "Can Editing LLMs Inject Harm?" [AAAI'26]☆21Dec 26, 2025Updated 5 months ago
- active learning + reusable workflows + likelihood free inference☆61Jun 10, 2017Updated 8 years ago