Batch Support for OpenAI Whisper
☆97Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for batch-whisper
Users that are interested in batch-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Apr 13, 2023Updated 3 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- The wizard of oz code used for collecting goal-oriented dialogue systems☆13Oct 30, 2017Updated 8 years ago
- ☆38Jul 4, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆20May 14, 2025Updated 11 months ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- openvino version of openai/whisper☆183Nov 6, 2023Updated 2 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆33Nov 7, 2024Updated last year
- Speaker prediction for captions on the Lex Fridman podcast☆27Feb 14, 2024Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Oct 6, 2023Updated 2 years ago
- Transcription and Diarization based on OpenAI's Whisper☆25Sep 9, 2025Updated 7 months ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- content.rdf.u8.gz☆11Dec 15, 2020Updated 5 years ago
- ☆13Sep 25, 2024Updated last year
- ☆23Oct 30, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated 2 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- faster inference☆28Jan 20, 2025Updated last year
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆38Dec 26, 2022Updated 3 years ago
- ☆24Dec 11, 2024Updated last year
- State-of-the-art architecture for Plant Disease Detection using Deep Learning.☆10Jul 4, 2022Updated 3 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 9 months ago
- ☆17May 5, 2024Updated last year
- A Pytorch implemtentation of ICCV 2019 paper Face Swapping Gan (https://arxiv.org/abs/1908.05932)☆21Nov 11, 2019Updated 6 years ago
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆33Feb 4, 2024Updated 2 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 4 months ago
- ☆12Nov 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Sep 26, 2022Updated 3 years ago
- Official Implementation of EnCLAP (ICASSP 2024)☆95Jun 2, 2024Updated last year
- Speaker Role Contextual Model for Dialogues☆15Sep 30, 2017Updated 8 years ago
- ☆158Nov 22, 2024Updated last year
- generate granular word-level captions in srt format☆57Sep 26, 2022Updated 3 years ago
- Music Line Bot powered by OLAMI and KKBOX Open API.☆11Dec 8, 2022Updated 3 years ago
- Faster distil-whisper transcription with CTranslate2☆14Jan 23, 2024Updated 2 years ago