Batch Support for OpenAI Whisper
☆97Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for batch-whisper
Users that are interested in batch-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Apr 13, 2023Updated 2 years ago
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 3 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 3 weeks ago
- The wizard of oz code used for collecting goal-oriented dialogue systems☆13Oct 30, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆38Jul 4, 2024Updated last year
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆20May 14, 2025Updated 10 months ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- openvino version of openai/whisper☆183Nov 6, 2023Updated 2 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆32Nov 7, 2024Updated last year
- Speaker prediction for captions on the Lex Fridman podcast☆27Feb 14, 2024Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Oct 6, 2023Updated 2 years ago
- Transcription and Diarization based on OpenAI's Whisper☆25Sep 9, 2025Updated 7 months ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Various tools written or modified by me☆10Oct 26, 2025Updated 5 months ago
- ☆10Oct 17, 2021Updated 4 years ago
- content.rdf.u8.gz☆11Dec 15, 2020Updated 5 years ago
- ☆13Sep 25, 2024Updated last year
- ☆23Oct 30, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated last year
- Simple Python library for doing (multiple) sequence alignment☆16Jun 24, 2018Updated 7 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- faster inference☆28Jan 20, 2025Updated last year
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- just for fun☆14Mar 11, 2018Updated 8 years ago
- State-of-the-art architecture for Plant Disease Detection using Deep Learning.☆10Jul 4, 2022Updated 3 years ago
- ☆24Dec 11, 2024Updated last year
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 8 months ago
- ☆17May 5, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Pytorch implemtentation of ICCV 2019 paper Face Swapping Gan (https://arxiv.org/abs/1908.05932)☆21Nov 11, 2019Updated 6 years ago
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆33Feb 4, 2024Updated 2 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆70Dec 23, 2025Updated 3 months ago
- ☆11Sep 26, 2022Updated 3 years ago
- ☆12Nov 7, 2024Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言(97种语言)的混合文本内容自动识别和拆分工具。☆21Feb 20, 2024Updated 2 years ago