epk2112 / fairseq_meta_mms_Google_Colab_implementationLinks
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)๐
โ19Updated 2 years ago
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below
Sorting:
- โ83Updated last year
- TTS with The Massively Multilingual Speech (MMS) projectโ235Updated last year
- A lightweight end-to-end text-to-speech modelโ126Updated 11 months ago
- Faster Tortoise inference then Tortoise Fast Forkโ127Updated last year
- โ259Updated last year
- โ233Updated 2 years ago
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ258Updated last year
- ๐ ๐ค Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningโ161Updated last year
- Fine Tune the Style-TTS2 Voice Modelโ266Updated 7 months ago
- โ175Updated 2 years ago
- โ157Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ100Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcriptionโ160Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionโ106Updated 7 months ago
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ43Updated 2 years ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteโ232Updated 11 months ago
- โ193Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.โ74Updated 6 months ago
- TorToiSe fine-tuning with DLASโ226Updated last year
- create dataset from list of youtube links easilyโ22Updated 2 years ago
- Site for sharing Bark voicesโ51Updated 10 months ago
- โ275Updated last year
- โ388Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning โฆโ27Updated 2 years ago
- โ100Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectโ54Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sโฆโ28Updated 2 years ago
- Open source inference code for Rev's modelโ435Updated 9 months ago
- Repository contains code to fine-tune WhisperASR modelโ23Updated 3 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.โ128Updated 6 months ago