Unbabel / COMET
A Neural Framework for MT Evaluation
☆508Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for COMET
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆251Updated last month
- ☆221Updated 5 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆697Updated last year
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,068Updated 3 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆351Updated last year
- Tools for checking ACL paper submissions☆598Updated last month
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 5 months ago
- The FLORES+ Machine Translation Benchmark☆99Updated last week
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆90Updated last month
- Yet Another Neural Machine Translation Toolkit☆174Updated 4 months ago
- BLEURT implementation in PyTorch☆30Updated last year
- Transformer based translation quality estimation☆107Updated last year
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- ☆315Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆102Updated 3 months ago
- Simple, fast unsupervised word aligner☆738Updated 2 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆114Updated last month
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆269Updated 2 years ago
- GEMBA — GPT Estimation Metric Based Assessment☆102Updated 3 months ago
- Machine Translation (MT) Preparation Scripts☆32Updated 3 months ago
- ☆193Updated this week
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 7 months ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆228Updated 2 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆285Updated last year
- State-of-the-art LLM-based translation models.☆437Updated last month
- Efficient Low-Memory Aligner☆139Updated 2 months ago
- Tools to download and cleanup Common Crawl data☆971Updated last year