shadowkiller33 / ParaScore
☆29Updated 2 years ago
Alternatives and similar repositories for ParaScore
Users that are interested in ParaScore are comparing it to the libraries listed below
Sorting:
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- ☆20Updated 4 years ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆14Updated last year
- ☆58Updated 3 years ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆12Updated 2 years ago
- Multicultural Proverbs and Sayings☆11Updated 4 months ago
- ☆48Updated 2 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆27Updated 3 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆20Updated last year
- ☆15Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- ☆17Updated last year
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆14Updated 3 years ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆25Updated last year
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆19Updated 4 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆31Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆15Updated 2 years ago
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆62Updated last year
- ☆82Updated 2 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆23Updated 2 months ago
- FRANK: Factuality Evaluation Benchmark☆55Updated 2 years ago
- PyTorch reimplementation of REALM and ORQA☆22Updated 3 years ago
- ☆27Updated 5 months ago
- ☆16Updated 2 months ago
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- ☆32Updated last month
- ☆21Updated last year