KevKibe / African-WhisperLinks
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
☆36Updated 9 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆68Updated 6 months ago
- ☆318Updated last year
- ☆158Updated 2 years ago
- ☆127Updated 9 months ago
- ☆183Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆182Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated last month
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆55Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 6 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- HF's ML for Audio study group☆199Updated 2 years ago
- ☆382Updated last year
- A python package for whisper normalizer☆70Updated 2 months ago
- ☆261Updated last year
- ☆49Updated 2 years ago
- create dataset from list of youtube links easily☆21Updated 2 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆353Updated 2 years ago
- A list of scripts/notebooks I'd like to keep handy☆18Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆193Updated 7 months ago
- ☆60Updated 5 months ago
- Collection of Open Source Speech Data☆163Updated 2 months ago
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆176Updated last month
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated last month
- A streaming whisper server for on-prem transcription☆22Updated last year
- Open TTS models, built for streaming on the edge☆44Updated 9 months ago
- ☆275Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- ☆12Updated 7 months ago
- VoiceBox neural network implementation☆110Updated last year