Helsinki-NLP / subalign
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for subalign
- Multilingual sentence alignment using sentence embeddings☆97Updated this week
- Bilingual sentence similarity classifier using Tensorflow☆19Updated 5 years ago
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- Sentence aligner☆108Updated 3 years ago
- Translation demonstrator☆27Updated 4 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆12Updated 2 months ago
- Efficient Low-Memory Aligner☆137Updated 2 months ago
- ☆11Updated 2 years ago
- Morfessor EM+Prune☆10Updated 4 years ago
- ☆12Updated 8 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆20Updated 2 years ago
- ☆67Updated 3 months ago
- Bilingual term extractor☆52Updated 10 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 4 months ago
- OpusFilter - Parallel corpus processing toolkit☆102Updated 2 months ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Updated 3 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 2 months ago
- Transform TMX to text☆29Updated last year
- ☆22Updated 11 months ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated last year
- ☆42Updated 6 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 4 years ago
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- A tiny BERT for low-resource monolingual models☆29Updated last month
- List of corpora annotated for coreference for different languages☆17Updated 3 months ago
- Automatic extraction of edited sentences from text edition histories.☆81Updated 2 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- Scripts and tools for doing unsupervised acceptability prediction.☆15Updated last year
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago