collabora/whisper-finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/collabora/whisper-finetuning)

collabora / whisper-finetuning

Whisper finetuning

☆17

Alternatives and similar repositories for whisper-finetuning

Users that are interested in whisper-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RuABraun / texterrors
View on GitHub
☆37Jun 9, 2026Updated last month
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
AI4Bharat / IndicMFA
View on GitHub
☆18Sep 13, 2024Updated last year
gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalam
View on GitHub
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
☆11Sep 7, 2021Updated 4 years ago
pzelasko / kaldialign
View on GitHub
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆70Jun 15, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HarikalarKutusu / 3d-voice-chess
View on GitHub
A voice driven 3D chess game for learning Voice AI
☆17Jul 6, 2022Updated 4 years ago
eliemaalouly / Transcription-whisper_pyannote
View on GitHub
☆17Nov 8, 2022Updated 3 years ago
xcmyz / CLONE
View on GitHub
☆20Jul 13, 2022Updated 4 years ago
vadimkantorov / convasr
View on GitHub
Baseline convolutional ASR system in PyTorch
☆21Nov 16, 2023Updated 2 years ago
hediet / rust-ffmpeg-frame-grabber
View on GitHub
Provides a frame iterator for videos by using ffmpeg. Decodes images using the image crate.
☆12Mar 31, 2021Updated 5 years ago
loglux / FlexAudioPrint
View on GitHub
FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …
☆10Apr 22, 2026Updated 3 months ago
PhilippeRo / IBus-Speech-To-Text
View on GitHub
A speech to text IBus engine using VOSK
☆40Nov 6, 2022Updated 3 years ago
harujoh / TensorFlowLiteNet
View on GitHub
TensorFlowLiteNet allows to use TensorFlowLite from C#.
☆11Apr 14, 2021Updated 5 years ago
lucasnewman / e2-tts-mlx
View on GitHub
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆21Oct 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sreecodeslayer / ml-am-lm-cmusphinx
View on GitHub
This is Malayalam Speech Recognition model developed for CMUSphinx. This is now used for Google Summer Code 2016
☆29Aug 22, 2016Updated 9 years ago
kurianbenoy / whisper_normalizer
View on GitHub
A python package for whisper normalizer
☆79Jul 17, 2026Updated last week
andi611 / Conditional-SpecGAN-Tensorflow
View on GitHub
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Dec 12, 2018Updated 7 years ago
uyghur-language / uyghur-language.github.io
View on GitHub
☆13Jul 12, 2026Updated last week
UyCode / uyfonts
View on GitHub
there are UKIJ and Uighursoft fonts
☆13Oct 21, 2022Updated 3 years ago
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
k2-fsa / text_search
View on GitHub
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
☆79Jun 30, 2025Updated last year
imraf / ai-model-reference
View on GitHub
A reference of hardware requirements for Gen AI models
☆15Apr 30, 2025Updated last year
popcornell / FastMSS
View on GitHub
☆32May 18, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
UlugbekSalaev / UzTransliterator
View on GitHub
UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language
☆13Jan 6, 2026Updated 6 months ago
emoncms / dashboard
View on GitHub
dashboard module for emoncms
☆14Jun 9, 2026Updated last month
148nasuka / Vocal2lab
View on GitHub
NNSVS向けの教師データのラベル作成支援ツールです。
☆10Apr 5, 2023Updated 3 years ago
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
gheyret / UyghurNgram
View on GitHub
Make N-Gram for Uyghur language
☆15Dec 24, 2020Updated 5 years ago
x893 / ProScan
View on GitHub
OBD Scan Tool .NET 2.0
☆13Oct 5, 2015Updated 10 years ago
ar1st0crat / MusCat
View on GitHub
Music Catalogizer + MP3 ID tag parser + Radio (WPF, WebApi, Angular)
☆14Oct 20, 2021Updated 4 years ago
secile / MotionJPEGWriter
View on GitHub
C# source code for creating MotionJPEG.
☆16Jan 9, 2020Updated 6 years ago
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Roboy / sonosco
View on GitHub
Framework for Deep Speech Recognition
☆11Nov 22, 2022Updated 3 years ago
azmat21 / Syllabification-for-Uyghur
View on GitHub
☆11Nov 13, 2015Updated 10 years ago
yweweler / single-speaker-tts
View on GitHub
This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron a…
☆12Dec 28, 2018Updated 7 years ago
bjnortier / whisper-tflite-ios
View on GitHub
☆19Nov 4, 2022Updated 3 years ago
gheyret / uyghur-asr-transformer
View on GitHub
Speech Recognition for Uyghur using Speech transformer
☆28Jun 19, 2021Updated 5 years ago
aksiksi / needle
View on GitHub
A CLI tool that finds a needle (opening/intro and ending/credits) in a haystack (TV or anime episode).
☆19Sep 8, 2024Updated last year
lingjzhu / probing-TTS-models
View on GitHub
Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf
☆32Jul 6, 2023Updated 3 years ago