The FLORES+ Machine Translation Benchmark
☆111Nov 12, 2024Updated last year
Alternatives and similar repositories for flores
Users that are interested in flores are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Seed Machine Translation Data☆33Nov 12, 2024Updated last year
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- ☆254May 30, 2024Updated last year
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- A tool that locates, downloads, and extracts machine translation corpora☆163Mar 22, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆134Jan 22, 2026Updated 2 months ago
- ☆14Jan 4, 2021Updated 5 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆127Oct 13, 2025Updated 5 months ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- Jojajovai Guarani-Spanish Parallel Corpus☆19Jul 5, 2022Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- ☆98Sep 25, 2025Updated 6 months ago
- (NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations☆15Apr 14, 2025Updated 11 months ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆21May 30, 2022Updated 3 years ago
- ☆35Jun 15, 2023Updated 2 years ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- ☆14Oct 6, 2025Updated 5 months ago
- ☆10Mar 22, 2024Updated 2 years ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- ☆18Nov 25, 2022Updated 3 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- simple translate☆12Mar 7, 2020Updated 6 years ago
- GEMBA — GPT Estimation Metric Based Assessment☆146Dec 15, 2025Updated 3 months ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- An educational tool to train, inspect, evaluate and translate using neural engines☆20Mar 13, 2025Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- State-of-the-art LLM-based translation models.☆583Apr 9, 2025Updated 11 months ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Feb 8, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Apr 20, 2024Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆300Updated this week
- ☆51Jul 25, 2024Updated last year
- Multilingual Open Text☆25May 8, 2025Updated 10 months ago
- machine translation data process tools☆10Apr 29, 2024Updated last year
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23May 4, 2023Updated 2 years ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year