Bilingual term extractor
☆59Nov 19, 2025Updated 4 months ago
Alternatives and similar repositories for tm2tb
Users that are interested in tm2tb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14May 5, 2022Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Updated this week
- ☆21Feb 13, 2023Updated 3 years ago
- An ambiguous subtitles dataset for visual scene-aware machine translation☆14Oct 17, 2022Updated 3 years ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Open-Source Machine Translation Quality Estimation in PyTorch☆233Jun 23, 2022Updated 3 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆52Apr 22, 2025Updated 11 months ago
- Efficient teacher-student models and scripts to make them☆55Dec 16, 2023Updated 2 years ago
- Curriculum training☆22Jun 25, 2025Updated 9 months ago
- Best Practices in Translation Memory Management☆47Dec 14, 2018Updated 7 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- Bitextor generates translation memories from multilingual websites☆301Nov 11, 2024Updated last year
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Apr 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Transformer based translation quality estimation☆114Jul 20, 2023Updated 2 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Aug 19, 2021Updated 4 years ago
- Ocelot is an Open Source XLIFF+ITS 2.0 Editor☆22Jun 15, 2021Updated 4 years ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 6 months ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆42Oct 13, 2022Updated 3 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 4 months ago
- ☆13Dec 11, 2020Updated 5 years ago
- Translate5: Open Source Translation System (published 1st time on github at 2020-08-10)☆49Mar 10, 2026Updated 2 weeks ago
- Web service for implementing a large-scale translation memory☆92Jun 14, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, …☆24Apr 8, 2022Updated 3 years ago
- Terminology EXtraction and Text Analytics (TEXTA) Toolkit☆35Nov 17, 2022Updated 3 years ago
- The FLORES+ Machine Translation Benchmark☆111Nov 12, 2024Updated last year
- A Neural Framework for MT Evaluation☆730Mar 5, 2026Updated 3 weeks ago
- Improved Sentence Alignment in Linear Time and Space☆192Mar 6, 2023Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆35Jun 29, 2025Updated 8 months ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆48Feb 19, 2025Updated last year
- GEMBA — GPT Estimation Metric Based Assessment☆146Dec 15, 2025Updated 3 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆127Oct 13, 2025Updated 5 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆59Jul 9, 2021Updated 4 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated last month
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Feb 25, 2015Updated 11 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆13Nov 23, 2021Updated 4 years ago
- ☆28Aug 25, 2025Updated 7 months ago
- Pascal2 Harvest project QuEst☆14Sep 15, 2014Updated 11 years ago