KevKibe / African-WhisperLinks
π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β27Updated 3 months ago
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below
Sorting:
- Place where folks can contribute to π€ community eventsβ9Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β62Updated last week
- β‘οΈFramework for fast persistent storage of multiple document embeddings and metadata into Pinecone for source-traceable, production-levelβ¦β13Updated 5 months ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Aiβ¦β10Updated 7 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β54Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationβ25Updated 2 years ago
- β157Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ56Updated last month
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β45Updated 10 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ106Updated 8 months ago
- Open TTS models, built for streaming on the edgeβ43Updated 2 months ago
- Audio tokenization, in the fastest way possible!β52Updated 9 months ago
- Repository contains code to fine-tune WhisperASR modelβ23Updated 2 years ago
- β46Updated 2 years ago
- Speaker Diarization with Transformersβ64Updated last year
- Speaker diarization serviceβ23Updated last month
- Efficient approach to speaker diarization using voice characteristics extractionβ94Updated last year
- Streamlit app for scheduling habits and interacting with your schedule using ChatGPT and LangChainβ19Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPUβ32Updated last year
- This repository contains code for fine-tuning the Whisper speech-to-text model.β9Updated last week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β15Updated 3 weeks ago
- A streaming whisper server for on-prem transcriptionβ20Updated 9 months ago
- A python package for whisper normalizerβ60Updated 3 weeks ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projectsβ23Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2β32Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- β127Updated 2 months ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated 2 years ago
- A collection of NLP notebooks for quick demos and hands-on learning.β19Updated last week