mozilla / translation-serviceLinks
This is the repo that hosts the code for Mozilla's translation service
☆28Updated last year
Alternatives and similar repositories for translation-service
Users that are interested in translation-service are comparing it to the libraries listed below
Sorting:
- The code, training pipeline, and models that power Firefox Translations☆204Updated this week
- Fast Neural Machine Translation in C++ - development repository☆276Updated last month
- Softcatalà neural translation models☆18Updated 7 months ago
- Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language☆45Updated 4 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆66Updated 2 months ago
- Bitextor generates translation memories from multilingual websites☆295Updated 9 months ago
- Polish morphological tagger.☆43Updated 2 years ago
- Mycroft's multilingual text parsing and formatting library☆77Updated 2 years ago
- Command line tool to create corpora for Common Voice☆78Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆219Updated last year
- Pure C natural language identifier with support for 97 languages☆26Updated 7 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆108Updated 3 months ago
- Open neural machine translation models and web services☆719Updated 2 months ago
- Neural Adaptive Machine Translation that adapts to context and learns from corrections.☆347Updated 3 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆51Updated last month
- Fast Neural Machine Translation in C++ - development repository☆21Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆324Updated 9 months ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆63Updated 2 months ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 9 months ago
- Tool for creation, manipulation and maintenance of voice corpora☆82Updated last year
- Tooling for producing French dataset for Common Voice☆101Updated 7 months ago
- Open language modeling toolkit based on PyTorch☆143Updated this week
- Efficient teacher-student models and scripts to make them☆51Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 5 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆171Updated 2 months ago
- Scraping Wikipedia for fair use sentences☆54Updated last year
- Metadata and versioning details for the Common Voice dataset☆152Updated 2 months ago
- Training scripts for Argos Translate☆139Updated this week