KevKibe / African-WhisperLinks
š Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
ā27Updated 4 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- Open TTS models, built for streaming on the edgeā43Updated 4 months ago
- ā158Updated 2 years ago
- create dataset from list of youtube links easilyā21Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.ā63Updated last month
- ā128Updated 3 months ago
- A lightweight Python library for running TTS models with a unified API.ā20Updated 5 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationā25Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.ā27Updated last year
- ā49Updated 2 years ago
- Speaker diarization serviceā23Updated 3 weeks ago
- A python package for whisper normalizerā63Updated last month
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023ā54Updated 2 years ago
- Open-source and reproducible benchmarks for Speaker Diarizationā29Updated this week
- Speaker Diarization with Transformersā67Updated last month
- Dippy Synthetic Speech Subnetā16Updated last month
- Repository for fine-tuning Transformers š¤ based seq2seq speech models in JAX/Flax.ā36Updated 2 years ago
- ā85Updated last year
- š¼ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionā15Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.ā20Updated 4 months ago
- ā62Updated 11 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sā¦ā28Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsā66Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRā59Updated last month
- ā16Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā96Updated last year
- Audio tokenization, in the fastest way possible!ā52Updated 10 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.ā17Updated 8 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLMā38Updated last year
- Zero-shot Audio Classification using Whisperā79Updated 2 years ago
- Tunable pipelinesā34Updated 4 months ago