The FLORES+ Machine Translation Benchmark
☆112Nov 12, 2024Updated last year
Alternatives and similar repositories for flores
Users that are interested in flores are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NTREX -- News Test References for MT Evaluation☆87Jun 5, 2024Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆766Nov 20, 2023Updated 2 years ago
- ☆254May 30, 2024Updated last year
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 8 months ago
- OpusFilter - Parallel corpus processing toolkit☆115Apr 8, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A tool that locates, downloads, and extracts machine translation corpora☆163Updated this week
- ☆139Apr 8, 2026Updated last week
- ☆14Jan 4, 2021Updated 5 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆128Oct 13, 2025Updated 6 months ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 11 months ago
- Jojajovai Guarani-Spanish Parallel Corpus☆20Jul 5, 2022Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- ☆98Sep 25, 2025Updated 6 months ago
- Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]☆15Apr 3, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- ☆21May 30, 2022Updated 3 years ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- ☆14Oct 6, 2025Updated 6 months ago
- ☆10Mar 22, 2024Updated 2 years ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated 2 years ago
- ☆18Nov 25, 2022Updated 3 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GEMBA — GPT Estimation Metric Based Assessment☆147Dec 15, 2025Updated 4 months ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- An educational tool to train, inspect, evaluate and translate using neural engines☆20Mar 13, 2025Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- State-of-the-art LLM-based translation models.☆584Apr 9, 2025Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆29Feb 8, 2023Updated 3 years ago
- [ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages☆106Updated this week
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆302Updated this week
- ☆51Jul 25, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆83Sep 21, 2023Updated 2 years ago
- Multilingual Open Text☆25May 8, 2025Updated 11 months ago
- machine translation data process tools☆10Apr 29, 2024Updated last year
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23May 4, 2023Updated 2 years ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- ☆20Mar 12, 2025Updated last year
- ☆272Aug 1, 2025Updated 8 months ago