loomchild / maligna
Bilingual sengence aligner
☆27Updated last year
Alternatives and similar repositories for maligna:
Users that are interested in maligna are comparing it to the libraries listed below
- Data collection, alignment and TAUS repository☆23Updated 7 years ago
- Sentence aligner☆112Updated 3 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 10 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆14Updated 7 months ago
- ☆21Updated 5 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- ☆12Updated 9 years ago
- Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".☆22Updated 5 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆20Updated 8 months ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆10Updated last week
- Translation Error Rate (TER)☆43Updated 6 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- ParCourE - Parallel Corpus Explorer☆12Updated 3 years ago
- Gale-Church sentence aligner with options for variable parameters☆17Updated 5 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Program used to split text into segments☆25Updated 5 months ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆37Updated 2 years ago
- Translation Memory Open-source Purifier☆34Updated 2 years ago
- List of corpora annotated for coreference for different languages☆17Updated 7 months ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- ☆17Updated 2 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated last month
- Offline extractor of synchronous context-free grammars for machine translation.☆31Updated 9 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Updated 10 years ago
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Updated 3 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago