browsermt / marian-dev
Fast Neural Machine Translation in C++ - development repository
☆20Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for marian-dev
- Efficient teacher-student models and scripts to make them☆48Updated 11 months ago
- A database of number names for 186 languages, locales, and scripts☆66Updated last year
- Translation demonstrator☆27Updated 4 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆45Updated 6 months ago
- Bilingual sentence similarity classifier using Tensorflow☆19Updated 5 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆10Updated 2 months ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Updated 9 months ago
- Thot toolkit for statistical machine translation☆50Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.py☆48Updated 5 months ago
- Fast approximate strings search & spelling correction☆57Updated 3 years ago
- Transform TMX to text☆29Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 3 months ago
- Website and documentation☆17Updated 3 weeks ago
- PurePos is an open source hybrid morphological tagger.☆15Updated 4 years ago
- arXiv plain text extraction☆42Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- Python package to compute metrics on an NLU intent parsing pipeline☆13Updated 4 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated last year
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆48Updated 2 months ago
- Fast stand-alone C++ decoder for RNN-based NMT models☆25Updated 3 years ago
- Corpus preprocessing☆95Updated 8 months ago
- Source code for the Apple reproduction☆31Updated 3 years ago
- zero-vocab or low-vocab embeddings☆17Updated 2 years ago
- NLP command-line assistant powered by OpenAI☆21Updated 9 months ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆30Updated 7 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- ☆67Updated 3 months ago