KevKibe / African-WhisperLinks
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
☆28Updated 6 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- ☆127Updated 5 months ago
- ☆157Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆162Updated last year
- ☆307Updated last year
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆67Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆99Updated 2 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆332Updated 2 years ago
- Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset☆11Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆35Updated last month
- Speaker Diarization with Transformers☆69Updated 2 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆54Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- HF's ML for Audio study group☆195Updated 2 years ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆109Updated 10 months ago
- ☆377Updated 11 months ago
- create dataset from list of youtube links easily☆21Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆371Updated last year
- Collection of Open Source Speech Data☆159Updated 9 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆61Updated 2 months ago
- ☆167Updated 8 months ago
- ☆262Updated last year
- ☆206Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Text-to-Speech for languages of India☆267Updated 9 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 5 months ago
- A streaming whisper server for on-prem transcription☆21Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆93Updated 2 weeks ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆148Updated last year