BLEURT is a metric for Natural Language Generation based on transfer learning.
☆788Aug 4, 2023Updated 2 years ago
Alternatives and similar repositories for bleurt
Users that are interested in bleurt are comparing it to the libraries listed below
Sorting:
- BERT score for text generation☆1,882Jul 30, 2024Updated last year
- A Neural Framework for MT Evaluation☆728Mar 5, 2026Updated 2 weeks ago
- ☆98Sep 25, 2025Updated 5 months ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆213Nov 20, 2023Updated 2 years ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,229Jan 12, 2026Updated 2 months ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆367Jun 27, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆652Jan 4, 2023Updated 3 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,494Jan 14, 2026Updated 2 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆127Oct 13, 2025Updated 5 months ago
- BLEURT implementation in PyTorch☆37Jan 19, 2023Updated 3 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆412Jun 23, 2024Updated last year
- Shared repository for open-sourced projects from the Google AI Language team.☆1,760Updated this week
- A tool for holistic analysis of language generations systems☆471Sep 22, 2025Updated 6 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,927Feb 14, 2023Updated 3 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98May 12, 2020Updated 5 years ago
- LAnguage Model Analysis☆1,390Jul 7, 2024Updated last year
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,267Aug 7, 2024Updated last year
- Simple, fast unsupervised word aligner☆767Jul 19, 2022Updated 3 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- Adversarial Natural Language Inference Benchmark☆399May 12, 2022Updated 3 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆766Nov 20, 2023Updated 2 years ago
- ☆329Jun 7, 2021Updated 4 years ago
- Conditional Transformer Language Model for Controllable Generation☆1,884May 1, 2025Updated 10 months ago
- Longformer: The Long-Document Transformer☆2,189Feb 8, 2023Updated 3 years ago
- Library for Knowledge Intensive Language Tasks☆970Mar 31, 2022Updated 3 years ago
- ☆53Apr 29, 2020Updated 5 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆495Feb 6, 2026Updated last month
- A neural word aligner based on multilingual BERT☆374Mar 10, 2022Updated 4 years ago
- Evaluation code for various unsupervised automated metrics for Natural Language Generation.☆1,391Aug 20, 2024Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆392Nov 7, 2023Updated 2 years ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆210Oct 20, 2021Updated 4 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,370Mar 23, 2024Updated 2 years ago
- ☆604Mar 12, 2026Updated last week
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,124Apr 20, 2022Updated 3 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆345Nov 11, 2024Updated last year