rewicks/ersatz

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rewicks/ersatz)

rewicks / ersatz

☆51

Alternatives and similar repositories for ersatz

Users that are interested in ersatz are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

joshua-decoder / thrax
View on GitHub
Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation
☆15Dec 2, 2016Updated 9 years ago
antonisa / embeddings
View on GitHub
Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages
☆15Apr 11, 2020Updated 6 years ago
sarapapi / hearing2translate
View on GitHub
A unified evaluation suite for speech-to-text translation, covering SpeechLLMs, SFMs, and cascaded systems across diverse real-world spee…
☆32Apr 25, 2026Updated 2 months ago
SAP / software-documentation-data-set-for-machine-translation
View on GitHub
A parallel evaluation data set of SAP software documentation with document structure annotation
☆15Jun 12, 2026Updated last month
SYSTRAN / fuzzy-match
View on GitHub
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
☆54Apr 22, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
shyyhs / CourseraParallelCorpusMining
View on GitHub
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
☆15Aug 27, 2024Updated last year
thammegowda / mtdata
View on GitHub
A tool that locates, downloads, and extracts machine translation corpora
☆165Apr 13, 2026Updated 3 months ago
Unbabel / MT-Telescope
View on GitHub
☆33Nov 22, 2021Updated 4 years ago
Yao-Dou / LENS
View on GitHub
☆25May 11, 2024Updated 2 years ago
robertostling / eflomal
View on GitHub
Efficient Low-Memory Aligner
☆148Jan 15, 2025Updated last year
Unbabel / OpenKiwi
View on GitHub
Open-Source Machine Translation Quality Estimation in PyTorch
☆233Jun 23, 2022Updated 4 years ago
paracrawl / keops
View on GitHub
Tool for manual evaluation of parallel sentences.
☆15Jan 26, 2026Updated 5 months ago
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
braunefe / Gargantua
View on GitHub
☆12Dec 9, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BUTSpeechFIT / hystoc
View on GitHub
Getting confidences from any end-to-end systems
☆11May 24, 2023Updated 3 years ago
stickeritis / sticker2
View on GitHub
Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot
☆13Dec 18, 2020Updated 5 years ago
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated 2 years ago
zouharvi / pearmut
View on GitHub
Platform for Evaluating and Reviewing of Multilingual Tasks
☆32Updated this week
sld / torch-conv-ner
View on GitHub
Deep learning for named entity recognition on CoNLL-2003
☆10Dec 23, 2016Updated 9 years ago
AppraiseDev / OCELoT
View on GitHub
Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations
☆23Jul 11, 2026Updated last week
salesforce / localization-xml-mt
View on GitHub
A High-Quality Multilingual Dataset for Structured Documentation Translation
☆39May 1, 2025Updated last year
sortiz / tmxt
View on GitHub
Transform TMX to text
☆27Nov 23, 2022Updated 3 years ago
EdinburghNLP / opus-100-corpus
View on GitHub
☆93Feb 13, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mayhewsw / multilingual-data-stats
View on GitHub
Statistics on multilingual datasets
☆17Jul 12, 2022Updated 4 years ago
czcorpus / InterText_server
View on GitHub
Collaborative on-line editor for aligned parallel texts.
☆14Jul 2, 2026Updated 2 weeks ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
encukou / czech-sort
View on GitHub
Python tool for simple Czech alphabetization
☆14Jul 12, 2023Updated 3 years ago
neulab / contextual-mt
View on GitHub
A repository with the code related to experiments around context-aware machine translation
☆51Sep 22, 2025Updated 9 months ago
valentinhofmann / flota
View on GitHub
☆18Feb 1, 2023Updated 3 years ago
naver / nllb-pruning
View on GitHub
Library for pruning experts per language pair in NLLB-200
☆35Jul 7, 2023Updated 3 years ago
jhclark / ducttape
View on GitHub
A workflow management system for researchers who heart Unix.
☆127Sep 23, 2015Updated 10 years ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
minimaxir / char-tsne-visualization
View on GitHub
Visualizations of character embeddings from derived character vectors.
☆13Apr 4, 2017Updated 9 years ago
G-Research / fast-string-search
View on GitHub
☆13Apr 13, 2021Updated 5 years ago
TharinduDR / TransQuest
View on GitHub
Transformer based translation quality estimation
☆114Jul 20, 2023Updated 3 years ago
lirondos / lazaro
View on GitHub
An observatory of anglicism usage in the Spanish press
☆11Updated this week
sheffieldnlp / highres
View on GitHub
☆16Dec 10, 2022Updated 3 years ago
thompsonb / prism
View on GitHub
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
☆102Jul 25, 2024Updated last year
jhdeov / ArmenianVerbs
View on GitHub
Paradigms of Armenian conjugation classes, and sample verb list
☆17Apr 13, 2022Updated 4 years ago