Helsinki-NLP/OPUS-MT-testsets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Helsinki-NLP/OPUS-MT-testsets)

Helsinki-NLP / OPUS-MT-testsets

benchmarks for evaluating MT models

☆11

Alternatives and similar repositories for OPUS-MT-testsets

Users that are interested in OPUS-MT-testsets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Veridise / zk-language-comparison
View on GitHub
Examples of Mastermind implemented in different ZK languages and frameworks.
☆17Mar 26, 2025Updated last year
salesforce / simplification
View on GitHub
☆23Jun 25, 2026Updated last month
ymoslem / MT-Tools
View on GitHub
Collection of Common Machine Translation Tools
☆11Jul 26, 2022Updated 4 years ago
proycon / spacy2folia
View on GitHub
Use spaCy for NLP and output to the FoLiA XML format.
☆12Feb 27, 2024Updated 2 years ago
nateraw / hf-text-classification
View on GitHub
☆12Feb 17, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
huggingface / hf_benchmarks
View on GitHub
A starter kit for evaluating benchmarks on the 🤗 Hub
☆18Apr 8, 2026Updated 3 months ago
K-Francis-H / libretranslate-unofficial-ff-extension
View on GitHub
An unofficial Firefox extension for the LibreTranslate API
☆14Dec 2, 2022Updated 3 years ago
lawl / translate
View on GitHub
☆12Mar 27, 2022Updated 4 years ago
brendel-group / objects-compositional-generalization
View on GitHub
Official code for the paper "Provable Compositional Generalization for Object-Centric Learning" (ICLR 2024, oral)
☆16Aug 26, 2024Updated last year
gmftbyGMFTBY / MomentumDecoding
View on GitHub
Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Jan 27, 2023Updated 3 years ago
BayBenj / english-syllabifier
View on GitHub
Tool for parsing English phonemes into syllables.
☆10Jan 15, 2018Updated 8 years ago
mrvoh / meta_learning_multilingual_doc_classification
View on GitHub
Placeholder repository
☆15Mar 16, 2022Updated 4 years ago
lt3 / nfr
View on GitHub
Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…
☆12Aug 14, 2024Updated last year
vsiivola / variKN
View on GitHub
A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…
☆42Sep 6, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wjbmattingly / hobbit-spacy
View on GitHub
☆23Aug 13, 2023Updated 2 years ago
Pleias / Pleias-Rag
View on GitHub
☆17Feb 25, 2025Updated last year
BramVanroy / spacy_download
View on GitHub
Download and load spaCy models on-the-fly
☆15Feb 9, 2023Updated 3 years ago
egorsmkv / NLLB-Translator
View on GitHub
☆16Oct 28, 2022Updated 3 years ago
wietsedv / xpos
View on GitHub
Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)
☆19May 17, 2022Updated 4 years ago
icoxfog417 / acl-anthology
View on GitHub
Script to get ACL Anthology
☆16Jan 2, 2025Updated last year
Helsinki-NLP / OPUS-CAT
View on GitHub
OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPU…
☆85Feb 4, 2025Updated last year
dreamATD / pianist-gnark
View on GitHub
The implementation of Pianist (a distributed variant of Plonk) based on gnark.
☆40Sep 7, 2023Updated 2 years ago
m-decoster / fpt4slt
View on GitHub
Frozen Pretrained Transformers for Neural Sign Language Translation
☆15Apr 23, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aws-samples / llm-evaluation-methodology
View on GitHub
☆47Mar 26, 2026Updated 4 months ago
SapienzaNLP / xl-amr
View on GitHub
XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).
☆17Jul 25, 2024Updated 2 years ago
NoUnique / pymecab-ko
View on GitHub
🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3
☆25Sep 23, 2025Updated 10 months ago
Pleias / toxic-commons
View on GitHub
The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.
☆22Jul 17, 2026Updated last week
luyaojie / chinese-nlp-conference-resource
View on GitHub
☆30Dec 24, 2019Updated 6 years ago
THU-KEG / COPEN
View on GitHub
The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".
☆21Mar 9, 2023Updated 3 years ago
hey-yahei / OpSummary.MXNet
View on GitHub
A tool to count operators and parameters of your MXNet-Gluon model.
☆22Apr 15, 2020Updated 6 years ago
zhangfh-cq / android-translator
View on GitHub
安卓课设：翻译君APP
☆16Dec 5, 2025Updated 7 months ago
Liushiyu-0709 / BAPO-Reliable-Search
View on GitHub
[ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"
☆31Apr 23, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SYSTRAN / similarity
View on GitHub
Bilingual sentence similarity classifier using Tensorflow
☆24Sep 26, 2019Updated 6 years ago
szza / yolov3_gluoncv_MXNet
View on GitHub
MXNet的一个开源项目gluoncv里的yolov3代码，写了一份中文注解
☆18Sep 8, 2019Updated 6 years ago
modernmt / DataCollection
View on GitHub
Data collection, alignment and TAUS repository
☆24Nov 30, 2017Updated 8 years ago
LEL-A / GerAlpacaDataCleaned
View on GitHub
German Alpaca Dataset (Cleaned + Translated)
☆26Apr 6, 2023Updated 3 years ago
TsinghuaAI / CPM-1-Pretrain
View on GitHub
Pretrain CPM-1
☆53Apr 20, 2021Updated 5 years ago
shiqimei / CMake-Tutorial
View on GitHub
CMake 中文教程
☆21Jan 29, 2019Updated 7 years ago
google-research / metricx
View on GitHub
☆146Jul 2, 2026Updated 3 weeks ago