mjpost/sacrebleu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mjpost/sacrebleu)

mjpost / sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

☆1,254

Alternatives and similar repositories for sacrebleu

Users that are interested in sacrebleu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hplt-project / sacremoses
View on GitHub
Python port of Moses tokenizer, truecaser and normalizer
☆497Feb 6, 2026Updated 5 months ago
Unbabel / COMET
View on GitHub
A Neural Framework for MT Evaluation
☆770Apr 21, 2026Updated 3 months ago
rsennrich / subword-nmt
View on GitHub
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,271Aug 7, 2024Updated last year
neulab / compare-mt
View on GitHub
A tool for holistic analysis of language generations systems
☆471Sep 22, 2025Updated 10 months ago
moses-smt / mosesdecoder
View on GitHub
Moses, the machine translation system
☆1,625Mar 28, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
clab / fast_align
View on GitHub
Simple, fast unsupervised word aligner
☆769Jul 19, 2022Updated 4 years ago
facebookresearch / flores
View on GitHub
Facebook Low Resource (FLoRes) MT Benchmark
☆771Nov 20, 2023Updated 2 years ago
google-research / bleurt
View on GitHub
BLEURT is a metric for Natural Language Generation based on transfer learning.
☆794Aug 4, 2023Updated 2 years ago
glample / fastBPE
View on GitHub
Fast BPE
☆677Jun 18, 2024Updated 2 years ago
Tiiiger / bert_score
View on GitHub
BERT score for text generation
☆1,909Jul 30, 2024Updated last year
thammegowda / mtdata
View on GitHub
A tool that locates, downloads, and extracts machine translation corpora
☆167Apr 13, 2026Updated 3 months ago
google / sentencepiece
View on GitHub
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,978Updated this week
THUNLP-MT / MT-Reading-List
View on GitHub
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
☆2,435Aug 9, 2024Updated last year
OpenNMT / OpenNMT-py
View on GitHub
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆7,010Oct 14, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
marian-nmt / marian
View on GitHub
Fast Neural Machine Translation in C++
☆1,462Aug 25, 2023Updated 2 years ago
facebookresearch / XLM
View on GitHub
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,923Feb 14, 2023Updated 3 years ago
rsennrich / wmt16-scripts
View on GitHub
scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task
☆139Nov 5, 2020Updated 5 years ago
facebookresearch / LASER
View on GitHub
Language-Agnostic SEntence Representations
☆3,661May 2, 2024Updated 2 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,251Sep 30, 2025Updated 9 months ago
lilt / alignment-scripts
View on GitHub
Scripts to preprocess training and test data and to run fast_align and giza
☆107Nov 2, 2021Updated 4 years ago
MorinoseiMorizo / jparacrawl-finetune
View on GitHub
An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.
☆105Apr 29, 2021Updated 5 years ago
neulab / awesome-align
View on GitHub
A neural word aligner based on multilingual BERT
☆379Mar 10, 2022Updated 4 years ago
EdinburghNLP / nematus
View on GitHub
Open-Source Neural Machine Translation in Tensorflow
☆805Dec 9, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Maluuba / nlg-eval
View on GitHub
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
☆1,391Aug 20, 2024Updated last year
bitextor / bitextor
View on GitHub
Bitextor generates translation memories from multilingual websites
☆299Nov 11, 2024Updated last year
thompsonb / prism
View on GitHub
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
☆102Jul 25, 2024Updated last year
Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Jul 1, 2026Updated 3 weeks ago
marian-nmt / marian-examples
View on GitHub
Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.
☆81Apr 8, 2023Updated 3 years ago
jhclark / tercom
View on GitHub
Translation Error Rate (TER)
☆44May 25, 2018Updated 8 years ago
jhclark / multeval
View on GitHub
Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …
☆205Feb 25, 2023Updated 3 years ago
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated 2 years ago
awslabs / sockeye
View on GitHub
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
☆1,215Oct 24, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Unbabel / OpenKiwi
View on GitHub
Open-Source Machine Translation Quality Estimation in PyTorch
☆233Jun 23, 2022Updated 4 years ago
isl-mt / SLT.KIT
View on GitHub
Spoken Language Translation System
☆20Jul 26, 2021Updated 4 years ago
bicici / FDA
View on GitHub
Feature Decay Algorithms
☆11Mar 5, 2014Updated 12 years ago
joeynmt / joeynmt
View on GitHub
Minimalist NMT for educational purposes
☆710Jan 29, 2024Updated 2 years ago
bzhangGo / zero
View on GitHub
Zero -- A neural machine translation system
☆152May 8, 2023Updated 3 years ago
fe1ixxu / ALMA
View on GitHub
State-of-the-art LLM-based translation models.
☆590Apr 9, 2025Updated last year
m-popovic / chrF
View on GitHub
a tool for calcualting character n-gram F score
☆80Feb 4, 2023Updated 3 years ago