pluiez / NLLB-inferenceLinks
☆57Updated 3 years ago
Alternatives and similar repositories for NLLB-inference
Users that are interested in NLLB-inference are comparing it to the libraries listed below
Sorting:
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆294Updated last week
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated 2 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆57Updated 3 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 9 months ago
- NTREX -- News Test References for MT Evaluation☆87Updated last year
- Library for pruning experts per language pair in NLLB-200☆34Updated 2 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆236Updated last year
- MAFAND-MT☆60Updated last year
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- This is a neural spelling checker☆69Updated 3 years ago
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆229Updated 2 years ago
- Bicleaner fork that uses neural networks☆40Updated last week
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆227Updated last year
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆58Updated last year
- A small seq2seq punctuator tool based on DistilBERT☆53Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆162Updated 4 months ago
- ☆34Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Updated 2 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆87Updated last year
- Curriculum training☆22Updated 6 months ago
- Experiments for XLM-V Transformers Integeration☆13Updated 2 years ago
- ☆127Updated this week
- The FLORES+ Machine Translation Benchmark☆109Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)☆135Updated last year
- Fast whitespace correction with Transformers☆17Updated 5 months ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Updated 4 years ago