KevKibe / African-Whisper
π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β21Updated last month
Alternatives and similar repositories for African-Whisper:
Users that are interested in African-Whisper are comparing it to the libraries listed below
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPUβ32Updated last year
- Dippy Synthetic Speech Subnetβ15Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β57Updated last week
- β121Updated 3 weeks ago
- β154Updated last year
- β‘οΈFramework for fast persistent storage of multiple document embeddings and metadata into Pinecone for source-traceable, production-levelβ¦β13Updated last month
- β62Updated 6 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β49Updated last year
- Speaker diarization serviceβ21Updated this week
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.β15Updated 3 months ago
- Audio tokenization, in the fastest way possible!β48Updated 5 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β27Updated last year
- A lightweight Python library for running TTS models with a unified API.β16Updated this week
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated 2 years ago
- create dataset from list of youtube links easilyβ17Updated last year
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- Repository contains code to fine-tune WhisperASR modelβ23Updated 2 years ago
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β34Updated last year
- A TTS model that makes a speaker speak new languagesβ76Updated 8 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ102Updated 4 months ago
- a simple system for 2-way interruptible voice interactions between human and LLMβ22Updated last year
- Use quantized versions of Whisper to speed up inferenceβ12Updated 4 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ141Updated last year
- A python package for whisper normalizerβ47Updated 2 months ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projectsβ22Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationβ25Updated 2 years ago
- β117Updated 3 months ago
- POS for African languagesβ17Updated last year
- Create an LJSpeech structured voice dataset on wave inputβ26Updated 4 months ago