SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models
☆17Jun 24, 2024Updated last year
Alternatives and similar repositories for SPUQ
Users that are interested in SPUQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆36Dec 17, 2024Updated last year
- ☆105Jun 30, 2024Updated last year
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated 2 years ago
- ☆27Apr 19, 2026Updated 2 weeks ago
- ☆79Apr 7, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python client library for Cleanlab Trustworthy Language Model☆24Dec 9, 2025Updated 5 months ago
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- ☆42Feb 2, 2024Updated 2 years ago
- [EMNLP 2024] A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models.☆21Sep 23, 2024Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Aug 15, 2024Updated last year
- Benchmarking LLMs via Uncertainty Quantification☆262Jan 30, 2024Updated 2 years ago
- [NAACL 2024] Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers https://arxiv.org/abs/2307.…☆17Jan 27, 2024Updated 2 years ago
- ComfyUI custom nodes for Haiper AI API☆14Dec 6, 2024Updated last year
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆14Mar 11, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The code corresponding to Transfer Learning for a Foundational Chemistry Model☆13Dec 5, 2023Updated 2 years ago
- This repository gathers the SchNet4AIM code along with some instructions and readme files.☆15Mar 13, 2024Updated 2 years ago
- ☆21Nov 26, 2024Updated last year
- ☆11Sep 25, 2025Updated 7 months ago
- JS/TS SDK for handling (extensible) events in Matrix☆10Jan 13, 2023Updated 3 years ago
- [ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆18Apr 3, 2025Updated last year
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆17May 24, 2024Updated last year
- Tutorial notebooks for SciFM24☆11Apr 2, 2024Updated 2 years ago
- ET-Tox☆12Oct 4, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12Jun 20, 2016Updated 9 years ago
- A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning☆10Aug 4, 2023Updated 2 years ago
- [ICML'24] Conformal Prediction for Deep Classifier via Label Ranking☆14Jun 14, 2024Updated last year
- A proof-of-concept graph Database on top of FoundationDB☆11Mar 1, 2019Updated 7 years ago
- Super learning of conditional survival functions with right-censored time-to-event outcomes in discrete or continuous time.☆15Dec 9, 2024Updated last year
- Repository for my personal site https://nicklashansen.github.io/, built with plain html.☆15Apr 4, 2026Updated last month
- Mesos scheduling framework for Changes.☆17Nov 5, 2016Updated 9 years ago
- Code for "Transformer-Based Deep Survival Analysis"☆12May 27, 2022Updated 3 years ago
- PromptCraft is a prompt perturbation toolkit from the character, word, and sentence levels for prompt robustness analysis. PyPI Package: …☆23Jan 3, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Display and update sorted information about FoundationDB processes☆16Jul 5, 2018Updated 7 years ago
- ☆14Jul 24, 2024Updated last year
- When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning☆12Jul 2, 2024Updated last year
- fft library for dart.☆13Jan 10, 2026Updated 3 months ago
- A demo instance of mage for pulling sample data from a public Google pub/sub topic and transforming with dbt.☆12Jan 5, 2024Updated 2 years ago
- Practical AI/ML for Computational Biology and Chemistry Workshop☆20Jun 20, 2022Updated 3 years ago
- ☆19Jun 3, 2023Updated 2 years ago