parambharat/whisper-finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/parambharat/whisper-finetuning)

parambharat / whisper-finetuning

Repository contains code to fine-tune WhisperASR model

☆23

Alternatives and similar repositories for whisper-finetuning

Users that are interested in whisper-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
mesolitica / multimodal-LLM
View on GitHub
Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.
☆18Feb 20, 2024Updated 2 years ago
du-ud / kaldi-cslt
View on GitHub
☆15Aug 30, 2022Updated 3 years ago
navalnica / whisper-finetuning-be
View on GitHub
Finetuning Whisper ASR model for Belarusian language
☆17Feb 16, 2025Updated last year
Vaibhavs10 / fast-whisper-finetuning
View on GitHub
☆562Jul 10, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tomfletcher / GeometryOfData
View on GitHub
☆10Apr 23, 2026Updated 3 months ago
warner-benjamin / NLPbookstudygroup
View on GitHub
NLP with Transformers Study Group Materials & Resources
☆11Jun 26, 2023Updated 3 years ago
siddh30 / The-Airbnb-Classification-Project
View on GitHub
This project is from the Airbnb Recruitment Challenge on Kaggle. The challenge is to solve a multi-class classification problem of predic…
☆11Feb 22, 2022Updated 4 years ago
rishabhjain16 / whisper_child_asr
View on GitHub
☆12May 23, 2023Updated 3 years ago
huggingface / community-events
View on GitHub
Place where folks can contribute to 🤗 community events
☆427Dec 7, 2023Updated 2 years ago
Mildemelwe / Non-English-Tacotron-2-Training-Notebook
View on GitHub
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
☆11Nov 19, 2022Updated 3 years ago
ZhihaoDU / du2022sond
View on GitHub
Speaker overlap-aware Neural Diarization
☆12Feb 13, 2023Updated 3 years ago
pengzhendong / streaming-asr
View on GitHub
One command to start a streaming ASR server.
☆12Oct 2, 2024Updated last year
dreji18 / Fine-tune-Speech-Recognition
View on GitHub
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆11Jul 2, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
CODEJIN / SPEECHSPLIT
View on GitHub
An implement of SPEECHSPLIT
☆15Sep 12, 2020Updated 5 years ago
mehedihasanbijoy / DPCSpell
View on GitHub
[Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages
☆14Aug 9, 2024Updated last year
HLasse / multidiagnosis-speech
View on GitHub
☆10Jun 23, 2023Updated 3 years ago
BriansIDP / AudioVisualLLM
View on GitHub
☆19May 19, 2024Updated 2 years ago
adamwdraper / tyler
View on GitHub
A development kit for manifesting AI agents with a complete lack of conventional limitations
☆17Jul 8, 2025Updated last year
deepberlin1 / aiforgood2020
View on GitHub
General information about DEEP BERLIN's AI for Good Hackathon 2020
☆11Apr 14, 2020Updated 6 years ago
philschmid / huggingface-container
View on GitHub
☆10Dec 15, 2022Updated 3 years ago
NCMlab / CognitiveTasks
View on GitHub
☆11Apr 4, 2023Updated 3 years ago
01-vyom / End_2_End_Automatic_Speech_Recognition_For_Gujarati
View on GitHub
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"
☆13Jul 26, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
xue926 / Shared-bicycle-usage-forecast
View on GitHub
本项目使用python对影响共享单车使用量的因素进行可视化分析，并使用lightGBM算法对已知条件下的共享单车使用量进行预测。其中为了选择最优模型，使用了k折交叉验证和网格搜索选择最优参数。
☆10Jul 15, 2020Updated 6 years ago
TIGER-AI-Lab / LLM-AMT
View on GitHub
This repository contains the code for our paper "Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering" [EMNLP…
☆14Oct 8, 2024Updated last year
QIAIUNCC / EYE-Llama
View on GitHub
☆12May 12, 2025Updated last year
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
wayne0926 / countdown
View on GitHub
很久以前写的人生倒计时工具，由于博客内无法运行，拿出来
☆11Jun 9, 2022Updated 4 years ago
philschmid / deep-learning-remote-runner
View on GitHub
☆16Aug 10, 2022Updated 3 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
frankkramer-lab / GPTNERMED
View on GitHub
GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.
☆15Oct 5, 2023Updated 2 years ago
vmanita / Customer-purchase-prediction
View on GitHub
Classification machine learning models to predict the probability of a client accepting a future marketing campaign/product release.
☆17Jul 27, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liu12366262626 / AlignVSR
View on GitHub
Visual Speech Recongnition
☆21Dec 24, 2024Updated last year
jumon / whisper-finetuning
View on GitHub
[WIP] Scripts for fine-tuning Whisper
☆221Jul 2, 2026Updated 3 weeks ago
lcn-kul / xls-r-analysis-sqa
View on GitHub
Analysis of XLS-R for Speech Quality Assessment
☆15Feb 10, 2025Updated last year
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
AbhinavUtkarsh / Image-Segmentation
View on GitHub
Image segmentation by KNN Algorithm project Report for subject Digital Image Processing (CS1553). This Project has an analysis of K - Nea…
☆11Aug 20, 2023Updated 2 years ago
MagicHub-io / CSASR_Challenge
View on GitHub
☆11Sep 26, 2022Updated 3 years ago
RazhanHameed / kurdish-llama
View on GitHub
This is an attempt to fine-tune the Llama model for Central Kurdish.
☆17May 24, 2023Updated 3 years ago