egorsmkv / NLLB-Translator
☆16Updated 2 years ago
Alternatives and similar repositories for NLLB-Translator:
Users that are interested in NLLB-Translator are comparing it to the libraries listed below
- Transformation spoken text to written text☆30Updated 10 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆270Updated 2 months ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆25Updated 2 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆97Updated 3 years ago
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆221Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated last year
- ☆57Updated 2 years ago
- Meta's "No Language Left Behind" models served as web app and REST API☆206Updated 7 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆104Updated this week
- Triton backend for https://github.com/OpenNMT/CTranslate2☆34Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆119Updated 7 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆50Updated 2 months ago
- 80x faster and 95% accurate language identification with Fasttext☆150Updated last year
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆161Updated 7 months ago
- Multilingual sentence alignment using sentence embeddings☆113Updated 4 months ago
- ☆55Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆138Updated 11 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- ☆242Updated 9 months ago
- Library for pruning experts per language pair in NLLB-200☆32Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆70Updated 11 months ago
- A small seq2seq punctuator tool based on DistilBERT☆50Updated 3 months ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- Multilingual Generative Pretrained Model☆206Updated 10 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆122Updated 3 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆209Updated 4 months ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆20Updated 8 months ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆43Updated 2 years ago