epk2112 / fairseq_meta_mms_Google_Colab_implementationLinks
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)๐
โ18Updated 2 years ago
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below
Sorting:
- TTS with The Massively Multilingual Speech (MMS) projectโ232Updated last year
- โ83Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)โ101Updated 2 months ago
- ๐ฌ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.โ217Updated 11 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ96Updated last year
- A lightweight end-to-end text-to-speech modelโ121Updated 7 months ago
- Faster Tortoise inference then Tortoise Fast Forkโ127Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.โ126Updated 2 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning โฆโ26Updated 2 years ago
- ๐ ๐ค Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningโ157Updated last year
- Site for sharing Bark voicesโ51Updated 6 months ago
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ251Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectโ53Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationโ25Updated 2 years ago
- โ232Updated last year
- A testing repo to share code and thoughts on diarisationโ56Updated last year
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ42Updated 2 years ago
- โ157Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extractionโ102Updated 4 months ago
- โ261Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXโ28Updated last year
- โ171Updated 10 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.โ68Updated 3 months ago
- SoTA open-source TTSโ99Updated 4 months ago
- ๐ผ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionโ15Updated last year
- Fine Tune the Style-TTS2 Voice Modelโ254Updated 4 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.โ119Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.โ119Updated 2 years ago
- โ174Updated last year
- A curated list of awesome OpenAI's Whisperโ98Updated 2 years ago