Blair-Johnson/batch-whisper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Blair-Johnson/batch-whisper)

Blair-Johnson / batch-whisper

Batch Support for OpenAI Whisper

☆97

Alternatives and similar repositories for batch-whisper

Users that are interested in batch-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
Wildhoney / ReactCrossfilter
View on GitHub
Crossfilter.js implemented as a mixin for ultra-fast filtering and sorting techniques baked into React.js components.
☆13Mar 3, 2015Updated 11 years ago
CookiePPP / podcast_rss_feeds
View on GitHub
List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.
☆31Apr 13, 2023Updated 3 years ago
StarDawn-VirtualSinger / fast-phasr-next
View on GitHub
☆10Nov 12, 2024Updated last year
wonjune-kang / llm-speech-summarization
View on GitHub
Prompting Large Language Models with Audio for General-Purpose Speech Summarization
☆20May 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AI4Bharat / Rasa
View on GitHub
Expressive TTS Dataset for Assamese, Bengali, and Tamil.
☆15Mar 6, 2025Updated last year
sidhantls / lexpod-speaker-prediction
View on GitHub
Speaker prediction for captions on the Lex Fridman podcast
☆26Feb 14, 2024Updated 2 years ago
zhuzilin / whisper-openvino
View on GitHub
openvino version of openai/whisper
☆184Nov 6, 2023Updated 2 years ago
HKAB / whisper-finetune-vietnamese
View on GitHub
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆38Oct 6, 2023Updated 2 years ago
Majdoddin / lexicaps
View on GitHub
Transcription and Diarization based on OpenAI's Whisper
☆25Sep 9, 2025Updated 10 months ago
rhss10 / joint-apa-mdd-mtl
View on GitHub
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆25Nov 9, 2023Updated 2 years ago
mrturck / mobile-vr-multiplayer
View on GitHub
☆12Dec 10, 2022Updated 3 years ago
alexey-lysiuk / tools
View on GitHub
Various tools written or modified by me
☆10Apr 14, 2026Updated 3 months ago
ryuclc / CosyVoice2-GRPO
View on GitHub
A simple implementation for improving CosyVoice2 by GRPO method
☆39May 5, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
aschubauer / SiderealAstroPy
View on GitHub
Astrology calculations with dual-zodiac (tropical and Hindu-Lahiri sidereal) options
☆15Dec 4, 2020Updated 5 years ago
AVGP / a-island
View on GitHub
Procedural island for A-Frame
☆16May 5, 2017Updated 9 years ago
Human-Centric-Machine-Learning / counterfactual-llms
View on GitHub
Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.
☆33Nov 7, 2024Updated last year
songweige / Dmoz-Dataset
View on GitHub
content.rdf.u8.gz
☆11Dec 15, 2020Updated 5 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
neosun100 / supertonic-tts-enhanced
View on GitHub
Enhanced Supertonic TTS with Docker, FastAPI, Web UI, and comprehensive API documentation
☆21Dec 7, 2025Updated 7 months ago
apayani / ILP
View on GitHub
☆10Nov 27, 2019Updated 6 years ago
pengzhendong / streaming-ChatTTS
View on GitHub
☆23Oct 30, 2024Updated last year
TeaPoly / PLCPA-ASYM-Loss
View on GitHub
The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss
☆15Sep 4, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
ex3ndr / supervoice-gpt-facodec
View on GitHub
GPT for FACodec
☆13Mar 25, 2024Updated 2 years ago
fleek / VADtransciber
View on GitHub
☆38Dec 26, 2022Updated 3 years ago
Pbatch / Codenames
View on GitHub
Codenames AI
☆12Jun 21, 2022Updated 4 years ago
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
worldveil / musical_mel_transform_torch
View on GitHub
Musical mel transform for semi/quarter-tone features, written in ONNX-compatible PyTorch for audio AI neural networks
☆20Feb 20, 2026Updated 5 months ago
jackfrost1411 / Residual_Teacher_Student
View on GitHub
State-of-the-art architecture for Plant Disease Detection using Deep Learning.
☆10Jul 4, 2022Updated 4 years ago
liuguoyou / Face-Swapping-GAN-Pytorch
View on GitHub
A Pytorch implemtentation of ICCV 2019 paper Face Swapping Gan (https://arxiv.org/abs/1908.05932)
☆21Nov 11, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jaeyeonkim99 / EnCLAP
View on GitHub
Official Implementation of EnCLAP (ICASSP 2024)
☆96Jun 2, 2024Updated 2 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
pengzhendong / compute-wer
View on GitHub
Compute WER and SER for speech recognition evaluation
☆27Jun 6, 2026Updated last month
lifeiteng / NotebookTTS
View on GitHub
Text-To-Speech for NotebookLM
☆39Jul 20, 2025Updated last year
cnbeining / Whisper_Notebook
View on GitHub
A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.
☆33Feb 4, 2024Updated 2 years ago
ishine / LangSegment
View on GitHub
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言（97种语言）的混合文本内容自动识别和拆分工具。
☆23Feb 20, 2024Updated 2 years ago
buildbotics / bbctrl-posts
View on GitHub
CAM Post Processors for the Buildbotics CNC Controller
☆13Nov 8, 2024Updated last year