edupoux / MVA_2022_SL
☆7Updated 2 years ago
Alternatives and similar repositories for MVA_2022_SL
Users that are interested in MVA_2022_SL are comparing it to the libraries listed below
Sorting:
- ☆100Updated 2 weeks ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆277Updated 3 months ago
- ☆353Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆335Updated last year
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆92Updated 10 months ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆445Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆132Updated 5 months ago
- ☆287Updated 11 months ago
- Various transformers for FSDP research☆37Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆151Updated last year
- Code for Zero-Shot Tokenizer Transfer☆127Updated 4 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆80Updated 8 months ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆105Updated last month
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- HF's ML for Audio study group☆192Updated 2 years ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆131Updated 5 months ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆39Updated 2 months ago
- MAFAND-MT☆55Updated 10 months ago
- Bicleaner fork that uses neural networks☆40Updated this week
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated last year
- Library for pruning experts per language pair in NLLB-200☆33Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 6 months ago
- The FLORES+ Machine Translation Benchmark☆102Updated 6 months ago
- The pipeline for the OSCAR corpus☆167Updated last year
- A repository for log-time feedforward networks☆222Updated last year
- Scalable and Performant Data Loading☆259Updated this week