epk2112 / fairseq_meta_mms_Google_Colab_implementation
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)๐
โ18Updated last year
Related projects โ
Alternatives and complementary repositories for fairseq_meta_mms_Google_Colab_implementation
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectโ52Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationโ84Updated last month
- โ77Updated 4 months ago
- โ152Updated last year
- A lightweight end-to-end text-to-speech modelโ91Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ84Updated 6 months ago
- โ73Updated last month
- Efficient approach to speaker diarization using voice characteristics extractionโ68Updated 6 months ago
- Repository contains code to fine-tune WhisperASR modelโ23Updated last year
- ๐ ๐ค Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningโ138Updated 4 months ago
- TTS with The Massively Multilingual Speech (MMS) projectโ226Updated 4 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteโ169Updated 2 months ago
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ40Updated last year
- audiolm-pytorch training codeโ15Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionโ262Updated 2 months ago
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper modelโ15Updated last year
- โ171Updated 11 months ago
- Speech Diarization for scrum automationโ97Updated last year
- VALL-E 2 reproductionโ87Updated 4 months ago
- VoiceBox neural network implementationโ96Updated 3 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sโฆโ27Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023โ47Updated last year
- โ61Updated 3 months ago
- Collection of Open Source Speech Dataโ146Updated 2 weeks ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.โ133Updated last year
- โ257Updated 5 months ago
- โ254Updated 8 months ago
- Faster Tortoise inference then Tortoise Fast Forkโ122Updated 7 months ago
- whisper.cpp bindings for pythonโ77Updated last year