Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
☆427Apr 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for docker-whisperX
Users that are interested in docker-whisperX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,363Apr 4, 2026Updated 2 weeks ago
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆17Aug 24, 2023Updated 2 years ago
- FastAPI service on top of WhisperX☆177Updated this week
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,485Feb 23, 2026Updated last month
- ☆3,166Apr 9, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Docker image for WhisperX by Max Bain☆12Sep 24, 2025Updated 6 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,734Updated this week
- OpenAI Whisper ASR Webservice API☆3,233Nov 23, 2025Updated 4 months ago
- faster-whisper as serverless endpoint☆137Apr 11, 2026Updated last week
- ☆18Feb 18, 2025Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,794Sep 9, 2025Updated 7 months ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆1,096Apr 9, 2026Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆252Feb 10, 2026Updated 2 months ago
- turnkey self-hosted offline transcription and diarization service with llm summary☆923Jan 18, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆48Aug 6, 2024Updated last year
- Self-hosted AI audio transcription☆2,559Mar 22, 2026Updated 3 weeks ago
- 本工具是python tkinter编写的一个简单的Gui,任务批量管理器。通过Gui选项生成*CMD*(command),来调用whisper,达到批量生成,管理的目的。支持whisper和whisperx☆58Aug 29, 2023Updated 2 years ago
- RunPod Serverless worker for WhisperX☆19Mar 26, 2026Updated 3 weeks ago
- A nearly-live implementation of OpenAI's Whisper.☆3,962Mar 17, 2026Updated last month
- ☆12,443Oct 25, 2025Updated 5 months ago
- rmp data ranking☆13Nov 4, 2025Updated 5 months ago
- A supplemental tool for generating EFCore Entity Model class's [ModelMetadataType] partial class. (a.k.a. Buddy Class).☆10Apr 24, 2022Updated 3 years ago
- Quickly build your full stack deployable development project.☆21Jan 11, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A high-performance batch audio transcription tool using nvidia/parakeet-tdt-0.6b-v2 to generate accurate, well-segmented SRT subtitles, w…☆18Dec 9, 2025Updated 4 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆64Jan 20, 2025Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Oct 30, 2024Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆202Nov 2, 2025Updated 5 months ago
- OpenAI Whisper Prompt Examples☆53Jul 17, 2023Updated 2 years ago
- Данный проект основан на llama.cpp и компилирует только RPC-сервер, а так же вспомогательные утилиты, работающие в режиме RPC-клиента, не…☆24May 25, 2025Updated 10 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,215Oct 29, 2025Updated 5 months ago
- Тестовый пример задействования модели для идентификации голоса с помощью библиотеки распознавания речи "Vosk" (Воск): https://alphacephei…☆12Aug 14, 2023Updated 2 years ago
- Real time transcription with OpenAI Whisper.☆2,921Apr 15, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- How to use OpenAIs Whisper to transcribe and diarize audio files☆376Oct 12, 2022Updated 3 years ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,068Jan 8, 2025Updated last year
- Batch Support for OpenAI Whisper☆97Jan 19, 2024Updated 2 years ago
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Aug 12, 2024Updated last year
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆2,977Nov 7, 2025Updated 5 months ago
- WhisperX Repository Modified to run on Mac☆18Oct 26, 2023Updated 2 years ago