Yixiao-Song / VeriScoreView external linksLinks
☆33Dec 17, 2025Updated last month
Alternatives and similar repositories for VeriScore
Users that are interested in VeriScore are comparing it to the libraries listed below
Sorting:
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆17Nov 29, 2022Updated 3 years ago
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…☆13Apr 25, 2024Updated last year
- ☆12Sep 1, 2021Updated 4 years ago
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆50Oct 23, 2025Updated 3 months ago
- ☆16Dec 10, 2022Updated 3 years ago
- FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data (NAACL 2025)☆14Jul 14, 2025Updated 6 months ago
- ☆15Aug 3, 2021Updated 4 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20May 14, 2022Updated 3 years ago
- [EMNLP-2025] R1-Zero on ANY TASK☆27Nov 9, 2025Updated 3 months ago
- ☆21Jul 28, 2022Updated 3 years ago
- ☆55Mar 27, 2023Updated 2 years ago
- ☆54Oct 24, 2024Updated last year
- Codebase for LLM Textual Hallucination Benchmark☆73Apr 25, 2025Updated 9 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆63Dec 25, 2023Updated 2 years ago
- ☆29Dec 2, 2024Updated last year
- ☆71Nov 27, 2024Updated last year
- ☆39Jun 7, 2023Updated 2 years ago
- This repository provides the dataset used in "Schema-Guided Natural Language Generation" by Yuheng Du, Shereen Oraby, Vittorio Perera, Mi…☆13Dec 8, 2020Updated 5 years ago
- A python implementation of PSNR that takes the Human visual system into account.☆12Jul 6, 2023Updated 2 years ago
- ☆23Oct 31, 2025Updated 3 months ago
- ☆17May 3, 2025Updated 9 months ago
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 2 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- ☆13Sep 26, 2024Updated last year
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- Fair paper matching☆11Jan 20, 2020Updated 6 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated 10 months ago
- ☆10May 27, 2024Updated last year
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …☆16Dec 14, 2023Updated 2 years ago
- ☆12Jul 25, 2023Updated 2 years ago
- ☆12Oct 17, 2024Updated last year
- Wenzhou-Kean University AI-LAB☆10Jun 6, 2022Updated 3 years ago
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated 10 months ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated last year