Unbabel / COMET
A Neural Framework for MT Evaluation
☆527Updated 2 weeks ago
Alternatives and similar repositories for COMET:
Users that are interested in COMET are comparing it to the libraries listed below
- A neural word aligner based on multilingual BERT☆336Updated 2 years ago
- ☆231Updated 7 months ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆711Updated last year
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,091Updated last week
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆353Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆263Updated this week
- A tool that locates, downloads, and extracts machine translation corpora☆149Updated 7 months ago
- Multilingual sentence alignment using sentence embeddings☆106Updated 2 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆96Updated last month
- Open-Source Machine Translation Quality Estimation in PyTorch☆228Updated 2 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆337Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆152Updated 7 months ago
- Facebook Low Resource (FLoRes) MT Benchmark☆717Updated last year
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- ☆199Updated last week
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆382Updated 6 months ago
- Transformer based translation quality estimation☆107Updated last year
- Tools for checking ACL paper submissions☆608Updated 2 months ago
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- The FLORES+ Machine Translation Benchmark☆100Updated 2 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆461Updated 2 years ago
- Tools to download and cleanup Common Crawl data☆980Updated last year
- Python port of Moses tokenizer, truecaser and normalizer☆490Updated 7 months ago
- Efficient Low-Memory Aligner☆140Updated this week
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆290Updated last year
- All-in-one text de-duplication☆648Updated 7 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆99Updated 8 months ago
- GEMBA — GPT Estimation Metric Based Assessment☆106Updated 5 months ago
- Yet Another Neural Machine Translation Toolkit☆176Updated 6 months ago
- Simple, fast unsupervised word aligner☆742Updated 2 years ago