π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β38Feb 27, 2025Updated last year
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Helpful Machine Learning resources for the Zindi communityβ15Aug 26, 2023Updated 2 years ago
- β17Apr 22, 2026Updated last week
- MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speechβ¦β29Jul 24, 2025Updated 9 months ago
- Analysis of XLS-R for Speech Quality Assessmentβ15Feb 10, 2025Updated last year
- This is about spam classification using HMM model in python languageβ19Nov 28, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains code for fine-tuning the Whisper speech-to-text model.β23Apr 21, 2026Updated last week
- my junior python prjectsβ16Jun 23, 2025Updated 10 months ago
- It's Corn (PogChamps #3) Kaggle Competition 1st Place Winning Solutionβ10Oct 4, 2022Updated 3 years ago
- β11Mar 7, 2025Updated last year
- PyTorch implementation of Retriever: Learning Content-Style Representationβ12Jan 27, 2023Updated 3 years ago
- A rust library to scrape an instagram user's photos and videosβ18Oct 3, 2024Updated last year
- Repository contains code to fine-tune WhisperASR modelβ23Dec 16, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- Create 3d CAD models with React using jscadβ49Aug 20, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β12Apr 18, 2025Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" β¦β11Nov 6, 2024Updated last year
- β46Mar 13, 2024Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Oct 12, 2022Updated 3 years ago
- Neural model for prediction of stress position in Russian wordsβ13Jun 22, 2025Updated 10 months ago
- A Rust library for the Hyperliquid APIβ25Sep 19, 2024Updated last year
- β25Oct 21, 2025Updated 6 months ago
- JSGF Deducer based on JSGF grammar and WFSTβ11Jan 11, 2018Updated 8 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.β31Feb 16, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Denoising autoencoders for speaker identification on MCE 2018 challengeβ12Nov 8, 2018Updated 7 years ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutesβ11Oct 19, 2023Updated 2 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioningβ15Jun 23, 2024Updated last year
- Multi-lingual AudioCapsβ13Nov 20, 2023Updated 2 years ago
- β12Oct 9, 2018Updated 7 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognitionβ19Jul 16, 2024Updated last year
- AI based singing voice synthesis database generatorβ13Aug 12, 2022Updated 3 years ago
- A project to take an audio file and separate it into speakers and play it with avatars and save the recording as an mp4 for sharing on soβ¦β13Nov 6, 2024Updated last year
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)β10Oct 11, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Chinese-ASR built on kaldiβ14Jan 21, 2019Updated 7 years ago
- 3rd Place solutionβ12Nov 20, 2024Updated last year
- bη«θ§ι’ι³θ½¨δΈθ½½ε¨οΌζ―ζε€PοΌ Rebuild from https://github.com/Quandong-Zhang/bilibiliAudioDownloader.ps1 with pythonβ11Jul 31, 2025Updated 9 months ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"β15Dec 22, 2022Updated 3 years ago
- β18Apr 26, 2025Updated last year
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possibleβ15Dec 19, 2023Updated 2 years ago
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.β36Apr 19, 2026Updated last week