Bilingual term extractor
☆59Nov 19, 2025Updated 6 months ago
Alternatives and similar repositories for tm2tb
Users that are interested in tm2tb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Translation Memory Open-source Purifier☆35Nov 6, 2022Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Apr 13, 2026Updated last month
- ☆21Feb 13, 2023Updated 3 years ago
- Self-managed translation project interface☆15Updated this week
- An ambiguous subtitles dataset for visual scene-aware machine translation☆14Oct 17, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- Calculates the word error rate of two strings, and the result is written into beautify HTML.☆19Mar 19, 2020Updated 6 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆233Jun 23, 2022Updated 3 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆52Apr 22, 2025Updated last year
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Efficient teacher-student models and scripts to make them☆57Dec 16, 2023Updated 2 years ago
- Curriculum training☆22Jun 25, 2025Updated 11 months ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆16Apr 18, 2024Updated 2 years ago
- Neural macine translation soft alignment visualisations for web and command line☆73Aug 19, 2021Updated 4 years ago
- OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPU…☆84Feb 4, 2025Updated last year
- Ocelot is an Open Source XLIFF+ITS 2.0 Editor☆21Jun 15, 2021Updated 4 years ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 8 months ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 6 months ago
- ☆13Dec 11, 2020Updated 5 years ago
- Translate5: Open Source Translation System (published 1st time on github at 2020-08-10)☆50May 13, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆40Jul 14, 2020Updated 5 years ago
- ☆34Nov 22, 2021Updated 4 years ago
- Terminology EXtraction and Text Analytics (TEXTA) Toolkit☆35Nov 17, 2022Updated 3 years ago
- ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, …☆24Apr 8, 2022Updated 4 years ago
- TextractAI: Extract and process text from PDFs using Python, OpenAI API, and OCR techniques.☆14Mar 23, 2024Updated 2 years ago
- The FLORES+ Machine Translation Benchmark☆112Nov 12, 2024Updated last year
- A two-dimensional vector library written in ES6.☆13Aug 23, 2015Updated 10 years ago
- Improved Sentence Alignment in Linear Time and Space☆194Mar 6, 2023Updated 3 years ago
- String Distance using cython☆13Jan 19, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆36Jun 29, 2025Updated 10 months ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆48Feb 19, 2025Updated last year
- GEMBA — GPT Estimation Metric Based Assessment☆149Dec 15, 2025Updated 5 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆130Apr 23, 2026Updated last month
- OpusFilter - Parallel corpus processing toolkit☆115May 13, 2026Updated last week
- Explore your own text collection with a topic model – without prior knowledge.☆67Mar 19, 2026Updated 2 months ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆42Dec 19, 2023Updated 2 years ago