epk2112 / fairseq_meta_mms_Google_Colab_implementationLinks
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)π
β18Updated 2 years ago
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below
Sorting:
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectβ52Updated last year
- TTS with The Massively Multilingual Speech (MMS) projectβ233Updated 11 months ago
- β83Updated 11 months ago
- β258Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ97Updated last week
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)β20Updated 5 years ago
- A lightweight end-to-end text-to-speech modelβ114Updated 4 months ago
- Deep learning based speech and pronunciation assessment API for 8 languages.β43Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ95Updated last year
- β174Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ152Updated last year
- Whisper from OpenAi and diarization with Pyannoteβ45Updated last year
- β151Updated 6 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXβ27Updated 8 months ago
- β235Updated last week
- ONNX-compatible Fast SeamlessM4TβMassively Multilingual & Multimodal Machine Translationβ43Updated last year
- web based editor for subtitles and transcriptsβ135Updated 10 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated 11 months ago
- Repository contains code to fine-tune WhisperASR modelβ23Updated 2 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β104Updated last month
- β231Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR modelsβ¦β18Updated 2 years ago
- Speaker Diarization with Transformersβ68Updated 2 weeks ago
- β158Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave inputβ30Updated 9 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β28Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationβ25Updated 2 years ago
- A list of scripts/notebooks I'd like to keep handyβ17Updated 10 months ago
- Site for sharing Bark voicesβ51Updated 3 months ago