Translation Memory Open-source Purifier
☆35Nov 6, 2022Updated 3 years ago
Alternatives and similar repositories for TMOP
Users that are interested in TMOP are comparing it to the libraries listed below
Sorting:
- Best Practices in Translation Memory Management☆47Dec 14, 2018Updated 7 years ago
- Java file conversion utility for converting Trados Studio SDLTM > TMX and SDLTB > CSV☆12Jun 12, 2017Updated 8 years ago
- Web service for implementing a large-scale translation memory☆92Jun 14, 2021Updated 4 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Neural Adaptive Machine Translation that adapts to context and learns from corrections.☆351Jul 7, 2022Updated 3 years ago
- Data collection, alignment and TAUS repository☆23Nov 30, 2017Updated 8 years ago
- Bilingual sengence aligner☆29Nov 25, 2025Updated 3 months ago
- Bitextor generates translation memories from multilingual websites☆301Nov 11, 2024Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 6 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- This is an unofficial memsource-cli-client project.☆14May 4, 2021Updated 4 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- Decoding platform for machine translation research☆54Aug 24, 2019Updated 6 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Transform TMX to text☆28Nov 23, 2022Updated 3 years ago
- Wagtail CMS quickstart for deployment on PythonAnywhere☆20Feb 22, 2023Updated 3 years ago
- TermitUp is a tool to generate Enriched Linked Terminologies from corpus, extracting knowledge from the Linguistic Linked Open Data cloud…☆22Jun 13, 2023Updated 2 years ago
- Recipes for training OpenNMT systems☆14Jul 26, 2017Updated 8 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- A specification and user manual for the Intento API – a single API to Cognitive AI models from many vendors.☆41Jan 20, 2026Updated 2 months ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- ☆25Jan 22, 2024Updated 2 years ago
- Docker configuration for MateCat web cattool https://github.com/matecat/MateCat☆21Oct 28, 2025Updated 4 months ago
- ☆21May 30, 2022Updated 3 years ago
- Translation Error Rate (TER)☆45May 25, 2018Updated 7 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Mar 16, 2022Updated 4 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆11May 30, 2024Updated last year
- scripts used for SMT system submitted to WMT 2014☆12Apr 30, 2017Updated 8 years ago
- Material for a course on Advanced NLP☆14Jul 22, 2025Updated 8 months ago
- ☆42Jul 17, 2018Updated 7 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆392Nov 7, 2023Updated 2 years ago
- Lightweight C++ translator for OpenNMT Torch models (deprecated)☆81Apr 7, 2020Updated 5 years ago
- The 14th Machine Translation Marathon 2019 in Edinburgh☆13Dec 8, 2022Updated 3 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆66Feb 14, 2026Updated last month
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- Machine-Translation-based sentence alignment tool for parallel text☆315Mar 18, 2021Updated 5 years ago
- A Neural Framework for MT Evaluation☆728Mar 5, 2026Updated 2 weeks ago
- A tool for converting TMX files into bilingual corpora☆19Feb 4, 2020Updated 6 years ago