nayeon7lee/FactualityPrompt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nayeon7lee/FactualityPrompt)

nayeon7lee / FactualityPrompt

☆90

Alternatives and similar repositories for FactualityPrompt

Users that are interested in FactualityPrompt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nayeon7lee / factuality_enhanced_lm_hf
View on GitHub
☆13Nov 11, 2022Updated 3 years ago
google / BEGIN-dataset
View on GitHub
A benchmark dataset for evaluating dialog system and natural language generation metrics.
☆39Jun 13, 2022Updated 4 years ago
sylinrl / TruthfulQA
View on GitHub
TruthfulQA: Measuring How Models Imitate Human Falsehoods
☆935Jan 16, 2025Updated last year
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆450Apr 13, 2025Updated last year
dunzeng / MORE
View on GitHub
Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment
☆16Aug 6, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RUCAIBox / HaluEval
View on GitHub
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆592Feb 12, 2024Updated 2 years ago
McGill-NLP / FaithDial
View on GitHub
☆51Feb 5, 2023Updated 3 years ago
voidism / DoLa
View on GitHub
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆557Jul 12, 2026Updated last week
Nanami18 / Snowballed_Hallucination
View on GitHub
☆43Sep 3, 2024Updated last year
AI21Labs / factor
View on GitHub
Code and data for the FACTOR paper
☆54Nov 15, 2023Updated 2 years ago
XiangLi1999 / ContrastiveDecoding
View on GitHub
contrastive decoding
☆206Nov 14, 2022Updated 3 years ago
derenlei / FactCG
View on GitHub
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data (NAACL 2025)
☆17Jul 14, 2025Updated last year
orhonovich / q-squared
View on GitHub
☆30Sep 5, 2021Updated 4 years ago
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HCY123902 / atg-w-fg-rw
View on GitHub
☆10May 27, 2024Updated 2 years ago
microsoft / HaDes
View on GitHub
Token-level Reference-free Hallucination Detection
☆97Jul 25, 2023Updated 2 years ago
armingh2000 / FactScoreLite
View on GitHub
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…
☆14Apr 25, 2024Updated 2 years ago
allenai / FineGrainedRLHF
View on GitHub
☆283Jan 6, 2025Updated last year
CogComp / Salient-Event-Detection
View on GitHub
The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"
☆10Jul 5, 2022Updated 4 years ago
HKUST-KnowComp / Knowledge-Constrained-Decoding
View on GitHub
Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…
☆30Nov 14, 2023Updated 2 years ago
sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
qipeng / arc-swift
View on GitHub
Implementation of "Arc-swift: A Novel Transition System for Dependency Parsing"
☆32Aug 21, 2018Updated 7 years ago
likenneth / honest_llama
View on GitHub
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆581Jan 28, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yinzhangyue / SelfAware
View on GitHub
Do Large Language Models Know What They Don’t Know?
☆103Nov 8, 2024Updated last year
thu-coai / DiaSafety
View on GitHub
This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark
☆25Aug 13, 2022Updated 3 years ago
zjunlp / FactCHD
View on GitHub
[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
☆90Apr 28, 2024Updated 2 years ago
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
princeton-nlp / EntityQuestions
View on GitHub
EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535
☆148Feb 21, 2022Updated 4 years ago
anthropics / hh-rlhf
View on GitHub
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,852Jun 17, 2025Updated last year
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
HillZhang1999 / llm-hallucination-survey
View on GitHub
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,085Sep 27, 2025Updated 9 months ago
dhgottesman / keen_estimating_knowledge_in_llms
View on GitHub
☆18Nov 5, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sunlab-osu / Understanding-CoT
View on GitHub
☆88Jun 1, 2023Updated 3 years ago
eric-mitchell / mend
View on GitHub
MEND: Fast Model Editing at Scale
☆259Aug 30, 2023Updated 2 years ago
QwenLM / ProcessBench
View on GitHub
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆190May 20, 2025Updated last year
xieyxclack / factual_coco
View on GitHub
The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.
☆17Nov 11, 2021Updated 4 years ago
jungokasai / twist_decoding
View on GitHub
☆30May 20, 2022Updated 4 years ago
FUZHIYI / TACO
View on GitHub
Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"
☆33Oct 23, 2022Updated 3 years ago
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 5 months ago