BLEURT is a metric for Natural Language Generation based on transfer learning.
☆786Aug 4, 2023Updated 2 years ago
Alternatives and similar repositories for bleurt
Users that are interested in bleurt are comparing it to the libraries listed below
Sorting:
- BERT score for text generation☆1,873Jul 30, 2024Updated last year
- A Neural Framework for MT Evaluation☆717Feb 5, 2026Updated 3 weeks ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆211Nov 20, 2023Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆651Jan 4, 2023Updated 3 years ago
- ☆98Sep 25, 2025Updated 5 months ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,227Jan 12, 2026Updated last month
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆367Jun 27, 2022Updated 3 years ago
- LAnguage Model Analysis☆1,390Jul 7, 2024Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,490Jan 14, 2026Updated last month
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆126Oct 13, 2025Updated 4 months ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,752Feb 20, 2026Updated last week
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆411Jun 23, 2024Updated last year
- New dataset☆311Aug 31, 2021Updated 4 years ago
- A tool for holistic analysis of language generations systems☆471Sep 22, 2025Updated 5 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,924Feb 14, 2023Updated 3 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98May 12, 2020Updated 5 years ago
- Adversarial Natural Language Inference Benchmark☆398May 12, 2022Updated 3 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- Library for Knowledge Intensive Language Tasks☆967Mar 31, 2022Updated 3 years ago
- Longformer: The Long-Document Transformer☆2,188Feb 8, 2023Updated 3 years ago
- ☆329Jun 7, 2021Updated 4 years ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,265Aug 7, 2024Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆765Nov 20, 2023Updated 2 years ago
- Simple, fast unsupervised word aligner☆767Jul 19, 2022Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Mar 23, 2024Updated last year
- ☆604Feb 20, 2026Updated last week
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆210Oct 20, 2021Updated 4 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆345Nov 11, 2024Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆389Nov 7, 2023Updated 2 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Aug 13, 2020Updated 5 years ago
- Evaluation code for various unsupervised automated metrics for Natural Language Generation.☆1,391Aug 20, 2024Updated last year
- Language-Agnostic SEntence Representations☆3,659May 2, 2024Updated last year
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆190May 23, 2025Updated 9 months ago
- Resources for the MRQA 2019 Shared Task☆294Aug 5, 2021Updated 4 years ago
- Code for using and evaluating SpanBERT.☆904Jul 25, 2023Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,123Apr 20, 2022Updated 3 years ago