ancs21/awesome-openai-whisper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ancs21/awesome-openai-whisper)

ancs21 / awesome-openai-whisper

A curated list of awesome OpenAI's Whisper

☆107

Alternatives and similar repositories for awesome-openai-whisper

Users that are interested in awesome-openai-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
OpenVoiceOS / ovos-plugin-manager
View on GitHub
plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere
☆14Updated this week
speechpro / mixup
View on GitHub
☆24Mar 13, 2020Updated 6 years ago
miguelvalente / whisperer
View on GitHub
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
☆137Aug 14, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
bayartsogt-ya / whisper-multiple-hf-datasets
View on GitHub
Whisper fine-tuning event script to use multiple hf datasets
☆32Dec 20, 2022Updated 3 years ago
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
vtuber-plan / vcvits
View on GitHub
Non Parallel Voice Conversion based on VITS
☆24Mar 31, 2023Updated 3 years ago
OpenVoiceOS / ovos-stt-plugin-vosk
View on GitHub
vosk STT plugin for mycroft
☆15Jun 15, 2026Updated last month
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
Archivoice / nnsvs-chinese-support
View on GitHub
Hed and supporting files for Chinese NNSVS Dataset Creation
☆13Oct 14, 2025Updated 9 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
homink / speech.ko
View on GitHub
Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language
☆43Feb 28, 2018Updated 8 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
ishine / PnG-BERT
View on GitHub
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
☆24Jan 29, 2022Updated 4 years ago
MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
koth / EmotiVoice.cpp
View on GitHub
cpp inference for EmotiVoice
☆16Jan 1, 2024Updated 2 years ago
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
HuangZiliAndy / RPNSD
View on GitHub
PyTorch implementation of RPNSD
☆60Jun 17, 2024Updated 2 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
View on GitHub
C++ version of pyannote audio overlapped speech detection pipeline
☆13Feb 14, 2024Updated 2 years ago
OHF-Voice / pymicro-wakeword
View on GitHub
Tensorflow-based wake word detection
☆20Jun 22, 2026Updated last month
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
avryhof / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Mar 9, 2022Updated 4 years ago
ProjectEGU / whisper-for-low-vram
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆29Dec 16, 2023Updated 2 years ago
Stage-Whisper / Stage-Whisper
View on GitHub
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automati…
☆261Apr 27, 2023Updated 3 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago