epk2112 / fairseq_meta_mms_Google_Colab_implementationLinks
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)๐
โ18Updated 2 years ago
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below
Sorting:
- TTS with The Massively Multilingual Speech (MMS) projectโ234Updated last year
- โ83Updated last year
- A lightweight end-to-end text-to-speech modelโ120Updated 7 months ago
- โ262Updated last year
- โ232Updated last year
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ252Updated last year
- ๐ ๐ค Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningโ157Updated last year
- โ175Updated last year
- โ169Updated 9 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectโ53Updated 2 years ago
- Site for sharing Bark voicesโ51Updated 6 months ago
- TorToiSe fine-tuning with DLASโ224Updated last year
- Faster Tortoise inference then Tortoise Fast Forkโ128Updated last year
- โ274Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-aiโ127Updated 2 years ago
- Fine Tune the Style-TTS2 Voice Modelโ252Updated 3 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionโ156Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.โ123Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ97Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.โ69Updated 2 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationโ185Updated 5 months ago
- Efficient approach to speaker diarization using voice characteristics extractionโ100Updated 3 months ago
- Meta's "No Language Left Behind" models served as web app and REST APIโ238Updated 4 months ago
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ43Updated 2 years ago
- Collection of Open Source Speech Dataโ160Updated last week
- โ36Updated 2 years ago
- Python bindings for whisper.cppโ246Updated last year
- ๐ฌ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.โ218Updated 10 months ago
- Finetune VITS and MMS using HuggingFace's toolsโ164Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsโ342Updated 10 months ago