Helsinki-NLP/Tatoeba-Challenge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Helsinki-NLP/Tatoeba-Challenge)

Helsinki-NLP / Tatoeba-Challenge

☆855

Alternatives and similar repositories for Tatoeba-Challenge

Users that are interested in Tatoeba-Challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Jul 1, 2026Updated 3 weeks ago
Helsinki-NLP / OPUS-MT-train
View on GitHub
Training open neural machine translation models
☆404Jan 17, 2026Updated 6 months ago
EdinburghNLP / opus-100-corpus
View on GitHub
☆93Feb 13, 2024Updated 2 years ago
Helsinki-NLP / Opus-MT
View on GitHub
Open neural machine translation models and web services
☆836Feb 23, 2026Updated 5 months ago
facebookresearch / flores
View on GitHub
Facebook Low Resource (FLoRes) MT Benchmark
☆771Nov 20, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Helsinki-NLP / OPUS-MT-testsets
View on GitHub
benchmarks for evaluating MT models
☆11Jun 26, 2024Updated 2 years ago
Unbabel / COMET
View on GitHub
A Neural Framework for MT Evaluation
☆770Apr 21, 2026Updated 3 months ago
facebookresearch / LASER
View on GitHub
Language-Agnostic SEntence Representations
☆3,661May 2, 2024Updated 2 years ago
thammegowda / mtdata
View on GitHub
A tool that locates, downloads, and extracts machine translation corpora
☆167Apr 13, 2026Updated 3 months ago
mjpost / sacrebleu
View on GitHub
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
☆1,254Jul 17, 2026Updated last week
neulab / awesome-align
View on GitHub
A neural word aligner based on multilingual BERT
☆379Mar 10, 2022Updated 4 years ago
robertostling / eflomal
View on GitHub
Efficient Low-Memory Aligner
☆148Jan 15, 2025Updated last year
bzhangGo / zero
View on GitHub
Zero -- A neural machine translation system
☆152May 8, 2023Updated 3 years ago
clab / fast_align
View on GitHub
Simple, fast unsupervised word aligner
☆769Jul 19, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Unbabel / OpenKiwi
View on GitHub
Open-Source Machine Translation Quality Estimation in PyTorch
☆233Jun 23, 2022Updated 4 years ago
marian-nmt / marian
View on GitHub
Fast Neural Machine Translation in C++
☆1,462Aug 25, 2023Updated 2 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,253Sep 30, 2025Updated 9 months ago
cisnlp / simalign
View on GitHub
[EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
☆398Nov 7, 2023Updated 2 years ago
google-research / bleurt
View on GitHub
BLEURT is a metric for Natural Language Generation based on transfer learning.
☆794Aug 4, 2023Updated 2 years ago
UKPLab / EasyNMT
View on GitHub
Easy to use, state-of-the-art Neural Machine Translation for 100+ languages
☆1,260Dec 21, 2023Updated 2 years ago
glample / fastBPE
View on GitHub
Fast BPE
☆677Jun 18, 2024Updated 2 years ago
wmt-conference / wmt21-news-systems
View on GitHub
☆26Jan 9, 2023Updated 3 years ago
google-research / xtreme
View on GitHub
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆651Jan 4, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / XLM
View on GitHub
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,923Feb 14, 2023Updated 3 years ago
thompsonb / prism
View on GitHub
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
☆102Jul 25, 2024Updated 2 years ago
THUNLP-MT / MT-Reading-List
View on GitHub
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
☆2,435Aug 9, 2024Updated last year
google / sentencepiece
View on GitHub
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,983Updated this week
rsennrich / subword-nmt
View on GitHub
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,271Aug 7, 2024Updated last year
hplt-project / sacremoses
View on GitHub
Python port of Moses tokenizer, truecaser and normalizer
☆497Feb 6, 2026Updated 5 months ago
jwieting / paraphrastic-representations-at-scale
View on GitHub
☆74Jul 2, 2021Updated 5 years ago
neulab / compare-mt
View on GitHub
A tool for holistic analysis of language generations systems
☆471Sep 22, 2025Updated 10 months ago
google-research / multilingual-t5
View on GitHub
☆1,294Dec 15, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MicrosoftTranslator / ToShipOrNotToShip
View on GitHub
☆19Dec 16, 2024Updated last year
M4t1ss / SoftAlignments
View on GitHub
Neural macine translation soft alignment visualisations for web and command line
☆73Aug 19, 2021Updated 4 years ago
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated 2 years ago
TharinduDR / TransQuest
View on GitHub
Transformer based translation quality estimation
☆114Jul 20, 2023Updated 3 years ago
Helsinki-NLP / OPUS-translator
View on GitHub
Translation demonstrator
☆37May 12, 2020Updated 6 years ago
KelleyYin / XLM-Plus
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
bitextor / bitextor
View on GitHub
Bitextor generates translation memories from multilingual websites
☆299Nov 11, 2024Updated last year