AmourWaltz/Awesome-Reliable-LLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AmourWaltz/Awesome-Reliable-LLM)

AmourWaltz / Awesome-Reliable-LLM

☆193

Alternatives and similar repositories for Awesome-Reliable-LLM

Users that are interested in Awesome-Reliable-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

D2I-ai / eigenscore
View on GitHub
☆46Dec 9, 2024Updated last year
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆66Dec 8, 2024Updated last year
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆15Mar 25, 2025Updated last year
satrams / rent-rl
View on GitHub
RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
☆42Oct 31, 2025Updated 8 months ago
technion-cs-nlp / hallucination-mitigation
View on GitHub
☆23Dec 17, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
balevinstein / Probes
View on GitHub
☆58Jun 30, 2023Updated 3 years ago
MaHuanAAA / logtoku
View on GitHub
☆42Aug 21, 2025Updated 11 months ago
zlin7 / UQ-NLG
View on GitHub
☆106Jun 30, 2024Updated 2 years ago
HanNight / AdaCAD
View on GitHub
Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"
☆16Mar 2, 2026Updated 4 months ago
aryamanarora / bayesian-laws-icl
View on GitHub
Bayesian scaling laws for in-context learning.
☆16Mar 12, 2025Updated last year
zepingyu0512 / in-context-mechanism
View on GitHub
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Nov 17, 2024Updated last year
sciai-lab / Truth_is_Universal
View on GitHub
☆34Nov 7, 2024Updated last year
tor4z / awesome-confidence-calibration
View on GitHub
awesome confidence calibration paper list
☆25Oct 21, 2021Updated 4 years ago
jlko / semantic_uncertainty
View on GitHub
Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).
☆421Apr 12, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pillowsofwind / Knowledge-Conflicts-Survey
View on GitHub
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆159Sep 21, 2024Updated last year
leopoldwhite / Awesome-Inference-Time-Trustworthiness
View on GitHub
☆15May 15, 2026Updated 2 months ago
zjunlp / KnowledgeEditingPapers
View on GitHub
Must-read Papers on Knowledge Editing for Large Language Models.
☆1,242Jun 25, 2026Updated last month
ernie-research / Tool-Augmented-Reward-Model
View on GitHub
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆54Jun 6, 2025Updated last year
Yixiao-Song / VeriScore
View on GitHub
☆39Dec 17, 2025Updated 7 months ago
claws-lab / XLingEval
View on GitHub
Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"
☆19May 14, 2026Updated 2 months ago
yangheng95 / InstOptima
View on GitHub
This repo is for our EMNLP2023 short paper (Findings): InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Langua…
☆14Jan 11, 2024Updated 2 years ago
snowood1 / BERT-ENN
View on GitHub
Uncertainty-Aware Reliable Text Classification (KDD 2021)
☆18Oct 4, 2022Updated 3 years ago
waltonfuture / Diff-eRank
View on GitHub
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆59May 28, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness
View on GitHub
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
☆833Jun 5, 2026Updated last month
RonMcKay / UQGAN
View on GitHub
UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs
☆10Apr 13, 2023Updated 3 years ago
rxlqn / awesome-llm-self-reflection
View on GitHub
augmented LLM with self reflection
☆144Nov 21, 2023Updated 2 years ago
OpenMOSS / Say-I-Dont-Know
View on GitHub
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆86Feb 5, 2024Updated 2 years ago
jinhaoduan / SAR
View on GitHub
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆63Sep 4, 2024Updated last year
starrYYxuan / LeCo
View on GitHub
This the implementation of LeCo
☆33Jan 20, 2025Updated last year
deeplearning-wisc / haloscope
View on GitHub
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
☆70Apr 11, 2025Updated last year
cognizant-ai-labs / semantic-density-paper
View on GitHub
This repo contains the source code for reproducing the experimental results in semantic density paper (Neurips 2024)
☆21Sep 28, 2025Updated 9 months ago
597358816 / AEPO
View on GitHub
Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning
☆17Jan 19, 2026Updated 6 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JiahaoChen1 / Calibration
View on GitHub
☆15Mar 20, 2023Updated 3 years ago
yhao-wang / LLM-Knowledge-Boundary
View on GitHub
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆21Jul 31, 2023Updated 2 years ago
EricLee8 / Multi-party-Dialogue-MRC
View on GitHub
Codes and data for EMNLP 2021 paper "Self- and Pseudo-self-supervised Prediction of Speaker and Key-utterance for Multi-party Dialogue Re…
☆16Oct 15, 2022Updated 3 years ago
HarlynDN / WebCiteS
View on GitHub
[ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations
☆13Sep 11, 2024Updated last year
Tencent / WebAggregator
View on GitHub
[ACL 2026 Main Conference] WebAggregator
☆69Oct 18, 2025Updated 9 months ago
armingh2000 / FactScoreLite
View on GitHub
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…
☆14Apr 25, 2024Updated 2 years ago
qinlibo-hit / Retriever-Dialogue
View on GitHub
end-to-end dialog system dataset
☆13Sep 15, 2019Updated 6 years ago