SYSTRAN/fuzzy-match

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SYSTRAN/fuzzy-match)

SYSTRAN / fuzzy-match

Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.

☆54

Alternatives and similar repositories for fuzzy-match

Users that are interested in fuzzy-match are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
OpenNMT / nmt-wizard-docker
View on GitHub
Dockerized NMT frameworks for nmt-wizard
☆39Apr 18, 2023Updated 3 years ago
thammegowda / mtdata
View on GitHub
A tool that locates, downloads, and extracts machine translation corpora
☆166Apr 13, 2026Updated 3 months ago
AppraiseDev / OCELoT
View on GitHub
Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations
☆23Jul 11, 2026Updated last week
marian-nmt / sotastream
View on GitHub
A library for data streaming and augmentation
☆22May 5, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
rewicks / ersatz
View on GitHub
☆51Jul 25, 2024Updated last year
urvashik / knnmt
View on GitHub
☆45Jun 7, 2021Updated 5 years ago
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
ymoslem / OpenNMT-Web-Interface
View on GitHub
Machine Translation Web Interface for OpenNMT-py
☆26Dec 24, 2021Updated 4 years ago
rudyerudite / AngErza
View on GitHub
Toy implementation of a Automated Exploit Generation built on Angr; stiched using radare, pwntools, pyelftools, and Angrop.
☆16Jan 9, 2022Updated 4 years ago
vchahun / galechurch
View on GitHub
Gale&Church (1993) sentence alignment
☆16May 9, 2020Updated 6 years ago
facebookresearch / mlqe
View on GitHub
We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…
☆81Aug 31, 2021Updated 4 years ago
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated 2 years ago
braunefe / Gargantua
View on GitHub
☆12Dec 9, 2015Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenNMT / Tokenizer
View on GitHub
Fast and customizable text tokenization library with BPE and SentencePiece support
☆334Jan 10, 2026Updated 6 months ago
Roxot / mbr-nmt
View on GitHub
Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation
☆16Oct 14, 2022Updated 3 years ago
ZurichNLP / ContraWSD
View on GitHub
Word sense disambiguation test sets for NMT
☆21Dec 3, 2020Updated 5 years ago
robertostling / eflomal
View on GitHub
Efficient Low-Memory Aligner
☆148Jan 15, 2025Updated last year
rsennrich / lingeval97
View on GitHub
☆18Oct 5, 2017Updated 8 years ago
argosopentech / LibreTranslate-cpp
View on GitHub
LibreTranslate C++ bindings
☆19Aug 27, 2021Updated 4 years ago
ise-uiuc / uniapr
View on GitHub
Fast and Precise On-the-fly Patch Validation for All
☆10Feb 24, 2023Updated 3 years ago
RUB-SysSec / PrimGen
View on GitHub
ACSAC 2018 paper: Towards Automated Generation of Exploitation Primitives for Web Browsers
☆15Nov 28, 2018Updated 7 years ago
luismond / tm2tb
View on GitHub
Bilingual term extractor
☆60Nov 19, 2025Updated 8 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MultiPath / Efficient-Neural-Machine-Translation
View on GitHub
PhD thesis (updating) of Jiatao Gu from HKU
☆19Aug 10, 2018Updated 7 years ago
salesforce / localization-xml-mt
View on GitHub
A High-Quality Multilingual Dataset for Structured Documentation Translation
☆39May 1, 2025Updated last year
BrightXiaoHan / optimum-ascend
View on GitHub
Optimized inference with Ascend and Hugging Face
☆12Apr 23, 2024Updated 2 years ago
wmt-conference / wmt-format-tools
View on GitHub
Tools for formatting WMT hypothesis and test sets in XML
☆27Apr 18, 2025Updated last year
NTDXYG / DualSC
View on GitHub
code and data for paper "Automatic Generation and Summarization of Shellcode via Transformer and Dual Learning", which accepted in SANER …
☆12May 8, 2022Updated 4 years ago
rsennrich / Bleualign
View on GitHub
Machine-Translation-based sentence alignment tool for parallel text
☆316Mar 18, 2021Updated 5 years ago
Unbabel / MT-Telescope
View on GitHub
☆33Nov 22, 2021Updated 4 years ago
Cartus / AMR-Parser
View on GitHub
Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)
☆10Jun 13, 2019Updated 7 years ago
hslh / pie-detection
View on GitHub
Automatic Detection of Potentially Idiomatic Expressions
☆12Feb 19, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
translate5 / translate5
View on GitHub
Translate5: Open Source Translation System (published 1st time on github at 2020-08-10)
☆51Updated this week
jcyk / copyisallyouneed
View on GitHub
Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory
☆81Jun 12, 2023Updated 3 years ago
kpu / fasterText
View on GitHub
Library for fast text representation and classification.
☆31Jan 9, 2024Updated 2 years ago
XuezheMax / NeuroNLP
View on GitHub
Deep neural models for core NLP tasks
☆13Nov 9, 2017Updated 8 years ago
MicrosoftTranslator / Translator-HumanParityData
View on GitHub
Human evaluation results and translation output for the Translator Human Parity Data release
☆37Mar 19, 2018Updated 8 years ago
modernmt / modernmt
View on GitHub
Neural Adaptive Machine Translation that adapts to context and learns from corrections.
☆350Jul 7, 2022Updated 4 years ago
lspecia / quest
View on GitHub
Pascal2 Harvest project QuEst
☆14Sep 15, 2014Updated 11 years ago