Batch Support for OpenAI Whisper
☆97Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for batch-whisper
Users that are interested in batch-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Apr 13, 2023Updated 3 years ago
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 3 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 2 months ago
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆20May 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- openvino version of openai/whisper☆183Nov 6, 2023Updated 2 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆33Nov 7, 2024Updated last year
- Speaker prediction for captions on the Lex Fridman podcast☆27Feb 14, 2024Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Oct 6, 2023Updated 2 years ago
- ☆12Aug 17, 2024Updated last year
- Various tools written or modified by me☆10Apr 14, 2026Updated last month
- A toy c compiler written in python☆11Jan 9, 2024Updated 2 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- Procedural island for A-Frame☆16May 5, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Never forget the resource that helps to close that sales call! Power a real-time speech-to-text agent with retrieval augmented generation…☆14Jan 23, 2024Updated 2 years ago
- Generate MANY nfts, become rich and retire at the age of 5.☆14Mar 8, 2022Updated 4 years ago
- ☆23Oct 30, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated 2 years ago
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- Simple Python library for doing (multiple) sequence alignment☆16Jun 24, 2018Updated 7 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- faster inference☆28Jan 20, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🔧 A Ratchet & Clank prototype made with Unreal Engine 4 in one week. A Game Development Process video is also available 📺☆14Mar 15, 2023Updated 3 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- CloudFront authorization with Cognito for CDK☆19Mar 27, 2026Updated last month
- just for fun☆14Mar 11, 2018Updated 8 years ago
- ☆24Dec 11, 2024Updated last year
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 10 months ago
- ☆17May 5, 2024Updated 2 years ago
- A Pytorch implemtentation of ICCV 2019 paper Face Swapping Gan (https://arxiv.org/abs/1908.05932)☆21Nov 11, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆33Feb 4, 2024Updated 2 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 4 months ago
- ☆11Sep 26, 2022Updated 3 years ago
- ☆12Nov 7, 2024Updated last year
- A repo to accopmany my youtube video on how to build an AI receptionist with langgraph☆16Aug 23, 2024Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆95Jun 2, 2024Updated last year
- generate granular word-level captions in srt format☆57Sep 26, 2022Updated 3 years ago