π Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
β37Feb 27, 2025Updated last year
Alternatives and similar repositories for African-Whisper
Users that are interested in African-Whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The application allows users to record speech, transcribe it using the Whisper ASR (Automatic Speech Recognition) model, translate the trβ¦β14Dec 4, 2023Updated 2 years ago
- Skin cancer classification project using deep learning techniques for automated diagnosis of skin lesions.β11Jun 2, 2024Updated last year
- MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speechβ¦β29Jul 24, 2025Updated 8 months ago
- β18Jan 27, 2026Updated 2 months ago
- Analysis of XLS-R for Speech Quality Assessmentβ15Feb 10, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- temporary files created by opensubtitles-scraperβ17Feb 3, 2026Updated 2 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ14Apr 6, 2025Updated last year
- This is about spam classification using HMM model in python languageβ19Nov 28, 2022Updated 3 years ago
- β16Jan 6, 2025Updated last year
- This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) mβ¦β24May 3, 2025Updated 11 months ago
- β15May 14, 2025Updated 10 months ago
- This repository contains code for fine-tuning the Whisper speech-to-text model.β23Mar 27, 2026Updated last week
- my junior python prjectsβ16Jun 23, 2025Updated 9 months ago
- Solution of Kaggle competition: MAP - Charting Student Math Misunderstandingsβ27Oct 25, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- It's Corn (PogChamps #3) Kaggle Competition 1st Place Winning Solutionβ10Oct 4, 2022Updated 3 years ago
- β12Mar 7, 2025Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Aiβ¦β26Oct 9, 2024Updated last year
- PyTorch implementation of Retriever: Learning Content-Style Representationβ12Jan 27, 2023Updated 3 years ago
- A rust library to scrape an instagram user's photos and videosβ18Oct 3, 2024Updated last year
- Repository contains code to fine-tune WhisperASR modelβ23Dec 16, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- β12Apr 18, 2025Updated 11 months ago
- Skin Cancer Object Detection-YOLOv5-YOLOv8β17Jun 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β22Oct 21, 2025Updated 5 months ago
- β14Nov 22, 2022Updated 3 years ago
- Ghana Crop Disease Detection Challenge - 5th Place Solutionβ15Dec 17, 2024Updated last year
- Google Translate API for Freeβ26Apr 22, 2021Updated 4 years ago
- β32Sep 22, 2024Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Oct 12, 2022Updated 3 years ago
- Neural model for prediction of stress position in Russian wordsβ13Jun 22, 2025Updated 9 months ago
- Compile typst documents with a simple HTTP requestβ39Jan 20, 2026Updated 2 months ago
- A Rust library for the Hyperliquid APIβ25Sep 19, 2024Updated last year
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β18Aug 29, 2021Updated 4 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.β31Feb 16, 2024Updated 2 years ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutesβ11Oct 19, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognitionβ19Jul 16, 2024Updated last year
- automatically align transcribed audio and generate a wav2letter training corpusβ36Apr 11, 2023Updated 2 years ago
- β57Sep 22, 2022Updated 3 years ago
- This repository provides an unofficial, reverse-engineered API for DeepSeek Chat & Coder (v2), allowing free and unlimited access to its β¦β28Jun 17, 2024Updated last year