DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
☆36Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for DiscoScore
Users that are interested in DiscoScore are comparing it to the libraries listed below
Sorting:
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- ☆14Feb 3, 2021Updated 5 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- Dataset for Paper "Exploring Content Selection in Summarization of Novel Chapters"☆14Mar 20, 2023Updated 3 years ago
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- ☆16Oct 20, 2025Updated 5 months ago
- Outline to Story: Fine-grained Controllable Story Generation from Cascaded Events☆18Jun 16, 2022Updated 3 years ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆16Sep 21, 2022Updated 3 years ago
- This is a repo for DCQA QUD parsing implemenation☆11Aug 5, 2025Updated 7 months ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆213Nov 20, 2023Updated 2 years ago
- Code for the paper "Do Massively Pretrained Language Models Make Better Storytellers?"☆75Jun 17, 2022Updated 3 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆33Jun 6, 2022Updated 3 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- ☆12Mar 13, 2025Updated last year
- ☆12Jun 29, 2025Updated 8 months ago
- ☆11May 9, 2023Updated 2 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- ☆29Dec 2, 2024Updated last year
- Paper list for open-ended language generation☆191Nov 17, 2022Updated 3 years ago
- ☆32Nov 16, 2021Updated 4 years ago
- Witwicky: An implementation of Transformer in PyTorch.☆22Aug 17, 2020Updated 5 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- A reference-free metric for measuring summary quality, learned from human ratings.☆43Dec 8, 2022Updated 3 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Apr 18, 2024Updated last year
- ☆12Feb 25, 2023Updated 3 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- Benchmark for evaluating open-ended generation☆51Nov 6, 2024Updated last year
- Dialog Acts SEGmentation: Tools for dialog act research☆14Mar 21, 2025Updated last year
- ☆11Jul 6, 2024Updated last year
- UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation☆59Oct 13, 2020Updated 5 years ago
- Sentence Embeddings used in the GermEval-2017 Submission☆13May 23, 2023Updated 2 years ago
- API client for fetching and comparing passages from legislation☆14Jan 26, 2025Updated last year
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Dec 8, 2022Updated 3 years ago
- Automated Semantic Analysis of Discourse Markers☆11May 30, 2022Updated 3 years ago