mozilla / translation-service
This is the repo that hosts the code for Mozilla's translation service
☆21Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for translation-service
- INACTIVE - Bergamot translator☆73Updated 10 months ago
- The code, training pipeline, and models that power Firefox Translations☆154Updated this week
- Efficient teacher-student models and scripts to make them☆48Updated 10 months ago
- CPU-optimized Neural Machine Translation models for Firefox Translations☆173Updated 3 weeks ago
- Translations website utilizing Bergamot proceedings☆61Updated 3 weeks ago
- Bicleaner fork that uses neural networks☆38Updated 3 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆48Updated 2 months ago
- Fast Neural Machine Translation in C++ - development repository☆20Updated 6 months ago
- Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.☆340Updated 6 months ago
- Fast Neural Machine Translation in C++ - development repository☆257Updated 3 weeks ago
- Customizable machine translation in C++☆42Updated 7 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆67Updated 6 months ago
- Metadata and versioning details for the Common Voice dataset☆141Updated last month
- Linguistic processing for Common Voice☆51Updated 9 months ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆48Updated 2 weeks ago
- Mozilla Voice Community Playbook☆43Updated 5 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆102Updated this week
- Softcatalà neural translation models☆18Updated this week
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 4 months ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆25Updated 2 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆150Updated 3 months ago
- The Open Parallel Corpus☆57Updated this week
- ☆51Updated last year
- Command line tool to create corpora for Common Voice☆75Updated 5 months ago
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Updated 4 years ago
- Targetted language identifier, based on FastText and Hunspell.☆29Updated 3 weeks ago
- Natural Language Inflection in English☆11Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆290Updated this week
- A guide to building language technology in new languages.☆57Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆73Updated last year