Unbabel / COMET
A Neural Framework for MT Evaluation
☆501Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for COMET
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- ☆218Updated 5 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆250Updated last month
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,066Updated 2 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆350Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆692Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 4 months ago
- Yet Another Neural Machine Translation Toolkit☆174Updated 4 months ago
- Tools for checking ACL paper submissions☆598Updated 2 weeks ago
- Facebook Low Resource (FLoRes) MT Benchmark☆703Updated 11 months ago
- ☆190Updated 5 months ago
- Easier Automatic Sentence Simplification Evaluation☆158Updated last year
- Transformer based translation quality estimation☆107Updated last year
- A tool for holistic analysis of language generations systems☆466Updated 2 years ago
- Official style files for papers submitted to venues of the Association for Computational Linguistics☆730Updated 5 months ago
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- OpusFilter - Parallel corpus processing toolkit☆102Updated 2 months ago
- GEMBA — GPT Estimation Metric Based Assessment☆100Updated 3 months ago
- Python port of Moses tokenizer, truecaser and normalizer☆487Updated 5 months ago
- The FLORES+ Machine Translation Benchmark☆99Updated 2 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆90Updated last month
- Efficient Low-Memory Aligner☆137Updated 2 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 6 months ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆228Updated 2 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆269Updated 2 years ago
- a tool for calcualting character n-gram F score☆66Updated last year
- Simple, fast unsupervised word aligner☆738Updated 2 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆114Updated 2 weeks ago
- cLang-8 is a dataset for grammatical error correction.☆102Updated 2 years ago