machinetranslate / machinetranslate.org
Open information and community for machine translation
☆71Updated this week
Related projects ⓘ
Alternatives and complementary repositories for machinetranslate.org
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- OpusFilter - Parallel corpus processing toolkit☆102Updated 2 months ago
- Bicleaner fork that uses neural networks☆38Updated 3 months ago
- Bilingual term extractor☆52Updated 10 months ago
- ☆190Updated 5 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- Curriculum training☆16Updated last month
- Translation Memory Open-source Purifier☆33Updated 2 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆20Updated 2 years ago
- An educational tool to train, inspect, evaluate and translate using neural engines☆18Updated last year
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 3 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 4 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆48Updated 2 months ago
- ☆67Updated 3 months ago
- Open language modeling toolkit based on PyTorch☆59Updated last week
- ☆43Updated 3 months ago
- Efficient Low-Memory Aligner☆137Updated 2 months ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆32Updated last year
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated 10 months ago
- a tool for calcualting character n-gram F score☆66Updated last year
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- This is a neural spell checker☆60Updated last year
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆31Updated last week
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- ☆13Updated 2 years ago
- Bilingual sentence similarity classifier using Tensorflow☆19Updated 5 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆66Updated last year
- ☆16Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆66Updated 6 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year