sanchit-gandhi / seq2seq-speech
Repository for fine-tuning Transformers š¤ based seq2seq speech models in JAX/Flax.
ā35Updated 2 years ago
Alternatives and similar repositories for seq2seq-speech:
Users that are interested in seq2seq-speech are comparing it to the libraries listed below
- Audio tokenization, in the fastest way possible!ā50Updated 7 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.ā27Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.ā12Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/ā¦ā26Updated 11 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesā13Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pā¦ā34Updated last year
- asr2kā49Updated 10 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.ā13Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.ā17Updated 4 months ago
- ā56Updated 2 years ago
- A collection of utilities for handling IPA phones.ā25Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub š¤ā”ļøā36Updated 2 years ago
- Speech in Flax/JAXā15Updated 2 years ago
- ā15Updated 2 years ago
- ā84Updated last year
- GPT for FACodecā13Updated last year
- My explorations into editing the knowledge and memories of an attention networkā34Updated 2 years ago
- ā20Updated 2 years ago
- ā86Updated this week
- Transcribing Speech with Multinomial Diffusion, training code and models.ā76Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Trainingā49Updated last year
- The demo page of UniAudioā33Updated last year
- Execute arbitrary SQL queries on š¤ Datasetsā32Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversionā49Updated 2 years ago
- ā62Updated 8 months ago
- ā64Updated 7 months ago
- Experiments with generating opensource language model assistantsā97Updated last year
- A JAX library for building lattice-based speech transducer modelsā45Updated 3 months ago
- ā34Updated 3 years ago
- Collection of scripts from mHuBERT-147.ā24Updated 4 months ago