The FLORES+ Machine Translation Benchmark
☆111Nov 12, 2024Updated last year
Alternatives and similar repositories for flores
Users that are interested in flores are comparing it to the libraries listed below
Sorting:
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆766Nov 20, 2023Updated 2 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- ☆254May 30, 2024Updated last year
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆126Oct 13, 2025Updated 4 months ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 3 weeks ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- Jojajovai Guarani-Spanish Parallel Corpus☆19Jul 5, 2022Updated 3 years ago
- ☆21May 30, 2022Updated 3 years ago
- ☆134Jan 22, 2026Updated last month
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- ☆98Sep 25, 2025Updated 5 months ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- machine translation data process tools☆10Apr 29, 2024Updated last year
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last week
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- ☆10Mar 22, 2024Updated last year
- ☆15Oct 4, 2024Updated last year
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Security Council resolutions in XML AKN4UN format☆17Updated this week
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆297Updated this week
- GEMBA — GPT Estimation Metric Based Assessment☆146Dec 15, 2025Updated 2 months ago
- ☆267Aug 1, 2025Updated 7 months ago
- Post-editing Datasets by Rakuten (PEDRa)☆14Jun 23, 2021Updated 4 years ago
- simple translate☆12Mar 7, 2020Updated 6 years ago
- ☆19Sep 16, 2025Updated 5 months ago
- Library for pruning experts per language pair in NLLB-200☆34Jul 7, 2023Updated 2 years ago
- Bilingual term extractor☆59Nov 19, 2025Updated 3 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆36Jun 7, 2025Updated 9 months ago
- ☆35Jun 15, 2023Updated 2 years ago
- A Neural Framework for MT Evaluation☆720Mar 2, 2026Updated last week
- A library for data streaming and augmentation☆21May 5, 2025Updated 10 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated last week
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆17Nov 29, 2022Updated 3 years ago