neulab/compare-mt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neulab/compare-mt)

neulab / compare-mt

A tool for holistic analysis of language generations systems

☆471

Alternatives and similar repositories for compare-mt

Users that are interested in compare-mt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thammegowda / mtdata
View on GitHub
A tool that locates, downloads, and extracts machine translation corpora
☆166Apr 13, 2026Updated 3 months ago
mjpost / sacrebleu
View on GitHub
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
☆1,253Updated this week
clab / fast_align
View on GitHub
Simple, fast unsupervised word aligner
☆769Jul 19, 2022Updated 4 years ago
facebookresearch / XLM
View on GitHub
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,925Feb 14, 2023Updated 3 years ago
nelson-liu / contextual-repr-analysis
View on GitHub
A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…
☆212Oct 20, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
facebookresearch / flores
View on GitHub
Facebook Low Resource (FLoRes) MT Benchmark
☆771Nov 20, 2023Updated 2 years ago
THUNLP-MT / MT-Reading-List
View on GitHub
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
☆2,436Aug 9, 2024Updated last year
lilt / alignment-scripts
View on GitHub
Scripts to preprocess training and test data and to run fast_align and giza
☆107Nov 2, 2021Updated 4 years ago
harvardnlp / pytorch-struct
View on GitHub
Fast, general, and tested differentiable structured prediction in PyTorch
☆1,132Apr 20, 2022Updated 4 years ago
facebookresearch / Mask-Predict
View on GitHub
A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…
☆246Sep 17, 2021Updated 4 years ago
rsennrich / subword-nmt
View on GitHub
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,271Aug 7, 2024Updated last year
hplt-project / sacremoses
View on GitHub
Python port of Moses tokenizer, truecaser and normalizer
☆497Feb 6, 2026Updated 5 months ago
pmichel31415 / teapot-nlp
View on GitHub
Tool for Evaluating Adversarial Perturbations on Text
☆61Feb 27, 2022Updated 4 years ago
ZurichNLP / domain-robustness
View on GitHub
☆13Dec 11, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Jul 1, 2026Updated 3 weeks ago
artetxem / vecmap
View on GitHub
A framework to learn cross-lingual word embedding mappings
☆655Apr 22, 2023Updated 3 years ago
lena-voita / good-translation-wrong-in-context
View on GitHub
This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …
☆101May 12, 2020Updated 6 years ago
facebookresearch / UnsupervisedMT
View on GitHub
Phrase-Based & Neural Unsupervised Machine Translation
☆1,499Sep 15, 2021Updated 4 years ago
ondrejklejch / MT-ComparEval
View on GitHub
Tool for comparison and evaluation of machine translation.
☆56May 17, 2022Updated 4 years ago
glample / fastBPE
View on GitHub
Fast BPE
☆677Jun 18, 2024Updated 2 years ago
artetxem / undreamt
View on GitHub
Unsupervised Neural Machine Translation
☆474Jul 8, 2020Updated 6 years ago
neulab / xnmt
View on GitHub
eXtensible Neural Machine Translation
☆189Sep 22, 2025Updated 10 months ago
mahfuzibnalam / terminology_evaluation
View on GitHub
☆21May 30, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rsennrich / wmt16-scripts
View on GitHub
scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task
☆139Nov 5, 2020Updated 5 years ago
nyu-mll / jiant
View on GitHub
jiant is an nlp toolkit
☆1,675Jul 6, 2023Updated 3 years ago
neulab / extreme-adaptation-for-personalized-translation
View on GitHub
Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"
☆42Sep 22, 2025Updated 10 months ago
bzhangGo / zero
View on GitHub
Zero -- A neural machine translation system
☆152May 8, 2023Updated 3 years ago
neulab / contextual-mt
View on GitHub
A repository with the code related to experiments around context-aware machine translation
☆51Sep 22, 2025Updated 10 months ago
facebookresearch / evaluation-of-nmt-bt
View on GitHub
This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …
☆15Aug 31, 2021Updated 4 years ago
nyu-dl / dl4mt-nonauto
View on GitHub
☆120Feb 20, 2019Updated 7 years ago
marian-nmt / sotastream
View on GitHub
A library for data streaming and augmentation
☆22May 5, 2025Updated last year
facebookresearch / LASER
View on GitHub
Language-Agnostic SEntence Representations
☆3,661May 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
violet-zct / DeMa-BWE
View on GitHub
NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)
☆63Dec 8, 2022Updated 3 years ago
neubig / mtandseq2seq-code
View on GitHub
Code examples for CMU CS11-731, Machine Translation and Sequence-to-sequence Models
☆35Nov 4, 2019Updated 6 years ago
kahne / NonAutoregGenProgress
View on GitHub
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
☆300Mar 15, 2023Updated 3 years ago
harvardnlp / urnng
View on GitHub
☆179Jul 31, 2020Updated 5 years ago
thompsonb / prism
View on GitHub
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
☆102Jul 25, 2024Updated last year
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated 2 years ago
microsoft / MASS
View on GitHub
MASS: Masked Sequence to Sequence Pre-training for Language Generation
☆1,116Nov 28, 2022Updated 3 years ago