Helsinki-NLP / subalignLinks
☆16Updated last year
Alternatives and similar repositories for subalign
Users that are interested in subalign are comparing it to the libraries listed below
Sorting:
- Multilingual sentence alignment using sentence embeddings☆121Updated 9 months ago
- ☆30Updated last year
- Improved Sentence Alignment in Linear Time and Space☆178Updated 2 years ago
- Sentence aligner☆116Updated 4 years ago
- OpusFilter - Parallel corpus processing toolkit☆109Updated this week
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆219Updated last year
- Bilingual sengence aligner☆28Updated last year
- Bilingual term extractor☆56Updated last year
- ☆11Updated 3 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Punctuation restoration and spell correction experiments.☆251Updated 4 years ago
- ☆74Updated 4 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Extracts parallel corpora from the 2 raw texts in different languages.☆36Updated 2 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆31Updated 6 months ago
- Transformer based translation quality estimation☆112Updated 2 years ago
- ☆12Updated 9 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆250Updated 2 years ago
- A list of awesome Machine Translation frameworks, libraries, software and papers☆193Updated last year
- Machine-Translation-based sentence alignment tool for parallel text☆311Updated 4 years ago
- Translation demonstrator☆34Updated 5 years ago
- Efficient Low-Memory Aligner☆146Updated 6 months ago
- Bicleaner fork that uses neural networks☆40Updated last month
- ☆42Updated 7 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆18Updated 2 years ago
- ☆49Updated last year
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆14Updated 11 months ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆35Updated 8 months ago