browsermt / marian-devLinks
Fast Neural Machine Translation in C++ - development repository
β22Updated last year
Alternatives and similar repositories for marian-dev
Users that are interested in marian-dev are comparing it to the libraries listed below
Sorting:
- Efficient teacher-student models and scripts to make themβ53Updated 2 years ago
- Toolkit for training/converting LibreTranslate compatible language models πβ73Updated 6 months ago
- Translation demonstratorβ35Updated 5 years ago
- A sentence segmentation library with wide language support optimized for speed and utility.β77Updated 3 weeks ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.β50Updated 8 months ago
- Training scripts for Argos Translateβ152Updated 3 weeks ago
- The Open Parallel Corpusβ80Updated 2 weeks ago
- Fast Neural Machine Translation in C++ - development repositoryβ283Updated 5 months ago
- Seed Machine Translation Dataβ33Updated last year
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.β55Updated 3 months ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-haβ¦β43Updated 3 years ago
- β92Updated 3 years ago
- GGML implementation of BERT model with Python bindings and quantization.β58Updated last year
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for MLβ65Updated 11 months ago
- Bitextor generates translation memories from multilingual websitesβ299Updated last year
- Extracts plain text, language identification and more metadata from WARC recordsβ23Updated 3 months ago
- Targetted language identifier, based on FastText and Hunspell.β38Updated 4 months ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)β18Updated last year
- With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for thisβ¦β21Updated 5 months ago
- Translate HTML using Argos Translateβ54Updated 2 years ago
- Open information and community for machine translationβ80Updated last month
- Bard Storyteller ebook reader with speech synthesis supportβ65Updated last year
- Faster, modernized fork of the language identification tool langid.pyβ61Updated last year
- A python module for word inflections designed for use with spaCy.β93Updated 5 years ago
- A database of number names for 186 languages, locales, and scriptsβ67Updated 2 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.β86Updated 4 years ago
- Fast SymSpell written in c++ and exposes to python via pybind11β44Updated 10 months ago
- CorrectLy - Open Source Spelling & Grammar correctionβ43Updated 3 years ago
- Documentation effort for the BookCorpus datasetβ34Updated 4 years ago
- Use an appropriate mix of LLMs based on https://nuenki.app/blog research to translate languages better than any one tool.β28Updated 6 months ago