KevKibe / African-WhisperLinks
π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β36Updated 11 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- β127Updated 10 months ago
- β321Updated last year
- β157Updated 2 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β56Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's toolsβ189Updated last year
- β63Updated 6 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β70Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ106Updated 7 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ71Updated 7 months ago
- HF's ML for Audio study groupβ200Updated 2 years ago
- A python package for whisper normalizerβ74Updated 3 months ago
- Open TTS models, built for streaming on the edgeβ45Updated 10 months ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β102Updated 5 months ago
- create dataset from list of youtube links easilyβ22Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β104Updated 5 months ago
- Speaker Diarization with Transformersβ69Updated 7 months ago
- β‘οΈFramework for fast persistent storage of multiple document embeddings and metadata into Pinecone for source-traceable, production-levelβ¦β13Updated last year
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ14Updated 2 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β357Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) projectβ235Updated last year
- β386Updated last year
- Repository contains code to fine-tune WhisperASR modelβ23Updated 3 years ago
- β192Updated last year
- β245Updated last month
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ150Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ100Updated last year
- Audio search using Azure Cognitive Searchβ26Updated 2 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)β37Updated 6 months ago
- A list of scripts/notebooks I'd like to keep handyβ18Updated last year
- A TTS model that makes a speaker speak new languagesβ76Updated last year