epk2112 / fairseq_meta_mms_Google_Colab_implementationLinks
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇
☆18Updated 2 years ago
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below
Sorting:
- ☆83Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆230Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- ☆232Updated last year
- Site for sharing Bark voices☆51Updated 7 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- A lightweight end-to-end text-to-speech model☆123Updated 8 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated last year
- ☆262Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 4 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Fine Tune the Style-TTS2 Voice Model☆256Updated 4 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆101Updated 2 months ago
- Python bindings for whisper.cpp☆246Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆70Updated 3 months ago
- Meta's "No Language Left Behind" models served as web app and REST API☆241Updated 5 months ago
- Tools for making LJSpeech datasets☆25Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆254Updated last year
- ☆172Updated 10 months ago
- ☆174Updated last year
- ASR + diarization model server with speculative decoding☆63Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- openvino version of openai/whisper☆176Updated 2 years ago
- ☆36Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- Audio datasets, easier.☆85Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Open source inference code for Rev's model☆433Updated 6 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆347Updated 2 years ago