google-research/true

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research/true)

google-research / true

Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".

☆92

Alternatives and similar repositories for true

Users that are interested in true are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

orhonovich / q-squared
View on GitHub
☆30Sep 5, 2021Updated 4 years ago
Liyan06 / AggreFact
View on GitHub
Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)
☆28Mar 26, 2024Updated 2 years ago
tanyuqian / ctc-gen-eval
View on GitHub
EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation
☆97Mar 20, 2023Updated 3 years ago
salesforce / factCC
View on GitHub
Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper
☆305May 1, 2025Updated last year
google-research-datasets / AIS
View on GitHub
AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…
☆31Jan 14, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Alex-Fabbri / AnswerSumm
View on GitHub
☆10Jul 18, 2022Updated 4 years ago
myt517 / DKT
View on GitHub
Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…
☆14Jul 23, 2022Updated 4 years ago
yuh-zha / AlignScore
View on GitHub
ACL2023 - AlignScore, a metric for factual consistency evaluation.
☆164Mar 11, 2024Updated 2 years ago
microsoft / HaDes
View on GitHub
Token-level Reference-free Hallucination Detection
☆97Jul 25, 2023Updated 3 years ago
Huffon / factsumm
View on GitHub
FactSumm: Factual Consistency Scorer for Abstractive Summarization
☆113Jan 1, 2024Updated 2 years ago
kenchan0226 / FineGrainedFact
View on GitHub
Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…
☆15Jan 25, 2024Updated 2 years ago
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆450Apr 13, 2025Updated last year
google / BEGIN-dataset
View on GitHub
A benchmark dataset for evaluating dialog system and natural language generation metrics.
☆39Jun 13, 2022Updated 4 years ago
TalSchuster / VitaminC
View on GitHub
Contrastive Fact Verification
☆74Sep 17, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ddehun / DEnsity
View on GitHub
Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"
☆11May 23, 2023Updated 3 years ago
salesforce / factualNLG
View on GitHub
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"
☆60Jun 2, 2026Updated last month
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
PrimerAI / blanc
View on GitHub
Human-free quality estimation of document summaries
☆97Dec 1, 2025Updated 7 months ago
HanNight / AMuLaP
View on GitHub
Code for NAACL 2022 paper "Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification"
☆25Oct 13, 2022Updated 3 years ago
iitmnlp / Dialogue-Evaluation-with-BERT
View on GitHub
☆31Jan 16, 2021Updated 5 years ago
zhichaoxu-shufe / context-aware-decoding-qfs
View on GitHub
☆14Jan 10, 2024Updated 2 years ago
jiho283 / FactKG
View on GitHub
Official repository of FactKG
☆67Apr 22, 2025Updated last year
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆522Oct 9, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
amazon-science / dstc11-track2-intent-induction
View on GitHub
DSTC 11 Track 2: Intent Induction from Conversations for Task-Oriented Dialogue
☆49May 5, 2023Updated 3 years ago
azinmatin / prince
View on GitHub
☆11Mar 25, 2022Updated 4 years ago
thu-coai / CTRLEval
View on GitHub
Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)
☆33Jun 6, 2022Updated 4 years ago
guijinSON / MM-Eval
View on GitHub
Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"
☆20Oct 26, 2024Updated last year
ryokamoi / wice
View on GitHub
This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.
☆43Dec 15, 2023Updated 2 years ago
allenai / better-promptability
View on GitHub
☆11Nov 27, 2022Updated 3 years ago
tagoyal / factuality-datasets
View on GitHub
☆46May 26, 2023Updated 3 years ago
nlpcl-lab / CADD_dataset
View on GitHub
CADD: A Large-scale Comprehensive Abusiveness Detection Dataset with Multifaceted Labels from Reddit
☆12Sep 28, 2022Updated 3 years ago
meetdavidwan / factpegasus
View on GitHub
PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)
☆40Sep 15, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tingofurro / summac
View on GitHub
Codebase, data and models for the SummaC paper in TACL
☆110Jan 30, 2025Updated last year
neulab / BARTScore
View on GitHub
BARTScore: Evaluating Generated Text as Text Generation
☆368Jun 27, 2022Updated 4 years ago
alexzhou907 / dialogue_evaluation
View on GitHub
☆22Dec 8, 2022Updated 3 years ago
ThomasScialom / QuestEval
View on GitHub
☆105Mar 4, 2024Updated 2 years ago
krandiash / gpt3-nli
View on GitHub
Training a model without a dataset for natural language inference (NLI)
☆25Aug 3, 2020Updated 5 years ago
ibraheem-moosa / mt-ranker
View on GitHub
Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking
☆10Feb 29, 2024Updated 2 years ago
violet-zct / fairseq-dro-mnmt
View on GitHub
☆14Sep 10, 2021Updated 4 years ago