Bilingual term extractor
☆59Nov 19, 2025Updated 3 months ago
Alternatives and similar repositories for tm2tb
Users that are interested in tm2tb are comparing it to the libraries listed below
Sorting:
- Translation Memory Open-source Purifier☆35Nov 6, 2022Updated 3 years ago
- Deployable MT engine☆21Dec 6, 2025Updated 2 months ago
- ☆21May 30, 2022Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Open information and community for machine translation☆81Updated this week
- Best Practices in Translation Memory Management☆47Dec 14, 2018Updated 7 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- Curriculum training☆22Jun 25, 2025Updated 8 months ago
- Calculates the word error rate of two strings, and the result is written into beautify HTML.☆19Mar 19, 2020Updated 5 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Aug 19, 2021Updated 4 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆51Apr 22, 2025Updated 10 months ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, …☆23Apr 8, 2022Updated 3 years ago
- Ocelot is an Open Source XLIFF+ITS 2.0 Editor☆22Jun 15, 2021Updated 4 years ago
- Random Turkish name generator with realistic probabilities.☆20Feb 14, 2021Updated 5 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆59Jul 9, 2021Updated 4 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆126Oct 13, 2025Updated 4 months ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 3 weeks ago
- ☆25Jun 25, 2019Updated 6 years ago
- This is the place where we develop and maintain most of plugins for Trados Studio. If you want to help us or just looking for some exampl…☆143Updated this week
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- ☆17Feb 21, 2026Updated last week
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Feb 25, 2015Updated 11 years ago
- ☆34Nov 22, 2021Updated 4 years ago
- ☆63Jan 25, 2024Updated 2 years ago
- OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPU…☆84Feb 4, 2025Updated last year
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- Self-managed translation project interface☆15Updated this week
- ☆34Nov 29, 2016Updated 9 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36May 18, 2023Updated 2 years ago
- Terminology EXtraction and Text Analytics (TEXTA) Toolkit☆35Nov 17, 2022Updated 3 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- Reinforcement Learning Recommender System suggesting relevant scientific services to appropriate researchers☆11Aug 29, 2024Updated last year
- COMET for African languages☆10Jan 24, 2025Updated last year