KevKibe / African-Whisper
π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β27Updated 2 months ago
Alternatives and similar repositories for African-Whisper:
Users that are interested in African-Whisper are comparing it to the libraries listed below
- Open TTS models, built for streaming on the edgeβ39Updated last month
- create dataset from list of youtube links easilyβ17Updated 2 years ago
- β126Updated last month
- Repository contains code to fine-tune WhisperASR modelβ23Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ50Updated 10 months ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPUβ32Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β53Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β62Updated 2 weeks ago
- β‘οΈFramework for fast persistent storage of multiple document embeddings and metadata into Pinecone for source-traceable, production-levelβ¦β13Updated 4 months ago
- Speaker diarization serviceβ21Updated last week
- A python package for whisper normalizerβ55Updated last week
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationβ25Updated 2 years ago
- Dippy Synthetic Speech Subnetβ16Updated 3 weeks ago
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ16Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β45Updated 8 months ago
- Audio search using Azure Cognitive Searchβ22Updated last year
- This repository contains code for fine-tuning the Whisper speech-to-text model.β8Updated 2 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ66Updated 2 years ago
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β35Updated 2 years ago
- Indic-Conformer models for ASRβ17Updated 9 months ago
- β156Updated last year
- β84Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ91Updated last year
- A lightweight Python library for running TTS models with a unified API.β17Updated 2 months ago
- A streaming whisper server for on-prem transcriptionβ20Updated 8 months ago
- Place where folks can contribute to π€ community eventsβ9Updated 2 years ago
- Text To Speech Multilingual Support (+20 Language)β43Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.β14Updated last month
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β28Updated 2 years ago