KevKibe / African-WhisperLinks
π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β27Updated 5 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- β127Updated 4 months ago
- β158Updated 2 years ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ107Updated 10 months ago
- β308Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ98Updated last month
- β205Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β54Updated 2 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β324Updated 2 years ago
- Video+code lecture on building nanoGPT from scratchβ69Updated last year
- Joint speech-language model - respond directly to audio!β370Updated last year
- A streaming whisper server for on-prem transcriptionβ20Updated 11 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β90Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ60Updated 2 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)β34Updated 2 weeks ago
- TTS with The Massively Multilingual Speech (MMS) projectβ234Updated last year
- Speaker Diarization with Transformersβ69Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β65Updated 2 weeks ago
- β‘οΈFramework for fast persistent storage of multiple document embeddings and metadata into Pinecone for source-traceable, production-levelβ¦β13Updated 7 months ago
- Finetune VITS and MMS using HuggingFace's toolsβ161Updated last year
- β124Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ96Updated last year
- Arxflix turns your boring Arxiv research paper into a captivating video.β52Updated last week
- β49Updated 2 years ago
- β35Updated 4 years ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUsβ38Updated 9 months ago
- Text-to-Speech for languages of Indiaβ261Updated 9 months ago
- an optimized, production-ready implementation of active speaker detectionβ67Updated last year
- Open TTS models, built for streaming on the edgeβ43Updated 4 months ago
- Repository contains code to fine-tune WhisperASR modelβ23Updated 2 years ago
- NVIDIA Riva runnable tutorialsβ141Updated last week