KevKibe / African-WhisperLinks
π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β31Updated 7 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- β127Updated 6 months ago
- β158Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's toolsβ164Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated 3 weeks ago
- Efficient approach to speaker diarization using voice characteristics extractionβ101Updated 3 months ago
- HF's ML for Audio study groupβ197Updated 2 years ago
- β170Updated 9 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ113Updated 11 months ago
- Open TTS models, built for streaming on the edgeβ43Updated 6 months ago
- β49Updated 2 years ago
- β‘οΈFramework for fast persistent storage of multiple document embeddings and metadata into Pinecone for source-traceable, production-levelβ¦β13Updated 9 months ago
- Collection of Open Source Speech Dataβ161Updated last week
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β54Updated 2 years ago
- β378Updated last year
- β310Updated last year
- A streaming whisper server for on-prem transcriptionβ22Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)β37Updated 2 months ago
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ15Updated last year
- Create an LJSpeech structured voice dataset on wave inputβ35Updated last year
- Speaker diarization serviceβ24Updated 3 months ago
- β131Updated last week
- Speaker Diarization with Transformersβ69Updated 3 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β98Updated last month
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β341Updated 2 years ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ185Updated 5 months ago
- Repository contains code to fine-tune WhisperASR modelβ23Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationβ25Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ66Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year
- Joint speech-language model - respond directly to audio!β372Updated last year