epk2112 / fairseq_meta_mms_Google_Colab_implementation
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)๐
โ18Updated last year
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation:
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below
- โ80Updated 7 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectโ52Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ92Updated 9 months ago
- โ200Updated 4 months ago
- FastAPI service on top of WhisperXโ68Updated 3 weeks ago
- Repository contains code to fine-tune WhisperASR modelโ23Updated 2 years ago
- This project presents a comprehensive study on video dubbing techniques and the development of a specialized video dubbing system.โ9Updated last year
- โ154Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionโ88Updated 9 months ago
- Tools for making LJSpeech datasetsโ24Updated last year
- โ254Updated 11 months ago
- TorToiSe fine-tuning with DLASโ218Updated 6 months ago
- TTS with The Massively Multilingual Speech (MMS) projectโ226Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.โ57Updated last week
- โ117Updated 2 months ago
- โ346Updated 5 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiโ129Updated last year
- ๐ Text-prompted Generative Audio Model - With the ability to clone voicesโ20Updated last year
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ42Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationโ119Updated 2 weeks ago
- ๐ง | RunPod worker of the faster-whisper model for Serverless Endpoint.โ84Updated 2 weeks ago
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ235Updated 8 months ago
- Faster Tortoise inference then Tortoise Fast Forkโ128Updated 10 months ago
- Python bindings for whisper.cppโ221Updated this week
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.โ286Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.โ135Updated last year
- โ94Updated 9 months ago
- Runpod WhisperX Docker Container Repoโ13Updated 11 months ago
- Audio datasets, easier.โ82Updated last year
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper modelโ15Updated 2 years ago