epk2112 / fairseq_meta_mms_Google_Colab_implementation
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for fairseq_meta_mms_Google_Colab_implementation
- ☆77Updated 4 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- A lightweight end-to-end text-to-speech model☆91Updated last month
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- ☆252Updated 7 months ago
- ☆152Updated last year
- ☆54Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- Python bindings for whisper.cpp☆216Updated 5 months ago
- ☆171Updated 11 months ago
- ☆223Updated 11 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆135Updated 3 months ago
- Runpod WhisperX Docker Container Repo☆11Updated 8 months ago
- Speech Diarization for scrum automation☆97Updated last year
- Site for sharing Bark voices☆48Updated 4 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- ☆87Updated 6 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆67Updated 6 months ago
- Gradio UI for a Cog API☆64Updated 7 months ago
- On-device streaming text-to-speech engine powered by deep learning☆54Updated last week
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆204Updated 5 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆226Updated 4 months ago
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆27Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆70Updated 3 weeks ago
- ☆40Updated 7 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated 2 weeks ago
- ☆34Updated 6 months ago
- Tools for making LJSpeech datasets☆21Updated 9 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆253Updated 2 months ago