epk2112 / fairseq_meta_mms_Google_Colab_implementationLinks
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)๐
โ18Updated 2 years ago
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below
Sorting:
- โ83Updated 11 months ago
- A lightweight end-to-end text-to-speech modelโ115Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extractionโ94Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ94Updated last year
- ๐ Text-prompted Generative Audio Model - With the ability to clone voicesโ20Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationโ25Updated 2 years ago
- โ144Updated 5 months ago
- TTS with The Massively Multilingual Speech (MMS) projectโ230Updated 10 months ago
- โ257Updated last year
- Code for OpenAI Whisper Web App Demoโ93Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationโ144Updated last year
- Faster Tortoise inference then Tortoise Fast Forkโ126Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-aiโ128Updated 2 years ago
- ASR + diarization model server with speculative decodingโ60Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)โ81Updated last year
- Speech Diarization for scrum automationโ105Updated last year
- Site for sharing Bark voicesโ51Updated 2 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectโ52Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023โ54Updated 2 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.โ91Updated 2 weeks ago
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ43Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXโ27Updated 7 months ago
- Google's SoundStorm: Efficient Parallel Audio Generationโ132Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationโ170Updated last month
- โ229Updated 2 months ago
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ245Updated 11 months ago
- Finetune VITS and MMS using HuggingFace's toolsโ154Updated last year
- Runpod WhisperX Docker Container Repoโ14Updated last year
- TorToiSe fine-tuning with DLASโ220Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.โ62Updated last week