KevKibe / African-WhisperLinks
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
☆35Updated 9 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- ☆318Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆177Updated last year
- ☆158Updated 2 years ago
- ☆127Updated 8 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆55Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆65Updated 5 months ago
- ☆12Updated 7 months ago
- ☆177Updated 11 months ago
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last month
- create dataset from list of youtube links easily☆21Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 3 months ago
- ☆261Updated last year
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆99Updated 3 months ago
- ☆49Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year
- Audio search using Azure Cognitive Search☆25Updated last week
- ☆378Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- A streaming whisper server for on-prem transcription☆22Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 4 months ago
- Joint speech-language model - respond directly to audio!☆372Updated last year
- Speaker Diarization with Transformers☆69Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Text-to-Speech for languages of India☆298Updated last year
- NVIDIA Riva runnable tutorials☆157Updated last week
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆352Updated 2 years ago
- SoTA open-source TTS☆114Updated 5 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆215Updated 7 months ago