KevKibe / African-WhisperLinks
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
☆36Updated 10 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- ☆319Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆70Updated 7 months ago
- ☆49Updated 2 years ago
- HF's ML for Audio study group☆199Updated 2 years ago
- ☆158Updated 2 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆55Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 6 months ago
- Finetune VITS and MMS using HuggingFace's tools☆188Updated last year
- ☆185Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆356Updated 2 years ago
- ☆127Updated 9 months ago
- A python package for whisper normalizer☆74Updated 3 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- ☆385Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year
- ☆275Updated last year
- ☆62Updated 6 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 4 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆233Updated last year
- ☆206Updated last year
- Tunable pipelines☆41Updated 4 months ago
- Real time web based Speech-to-Text app with Streamlit☆254Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- finetune llm part for spark-tts model☆119Updated 9 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- Open TTS models, built for streaming on the edge☆44Updated 9 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 5 months ago
- ☆261Updated last year