jim60105 / docker-whisperXView external linksLinks
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
☆418Feb 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for docker-whisperX
Users that are interested in docker-whisperX are comparing it to the libraries listed below
Sorting:
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,051Updated this week
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆16Aug 24, 2023Updated 2 years ago
- FastAPI service on top of WhisperX☆170Updated this week
- Docker image for WhisperX by Max Bain☆12Sep 24, 2025Updated 4 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,355Nov 26, 2025Updated 2 months ago
- This repository collects all of my containerization works and serves as a template for quickly starting new projects. (Containerfile temp…☆12Nov 14, 2025Updated 3 months ago
- rmp data ranking☆13Nov 4, 2025Updated 3 months ago
- OpenAI Whisper ASR Webservice API☆3,158Nov 23, 2025Updated 2 months ago
- ☆2,935Updated this week
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- Faster Whisper transcription with CTranslate2☆20,833Nov 19, 2025Updated 2 months ago
- This is a playlist tool to play video clips with "start~end time" directly on Youtube/GoogleDrive/TwitCasting. (Chrome extension) https:/…☆19Dec 28, 2025Updated last month
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆250Updated this week
- Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!☆2,913Aug 15, 2025Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,530Nov 12, 2025Updated 3 months ago
- Данный проект основан на llama.cpp и компилирует только RPC-сервер, а так же вспомогательные утилиты, работающие в режиме RPC-клиента, не…☆23May 25, 2025Updated 8 months ago
- A python package to build AI-powered real-time audio applications☆1,931Feb 12, 2025Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆162Feb 1, 2024Updated 2 years ago
- ☆13Nov 24, 2025Updated 2 months ago
- A supplemental tool for generating EFCore Entity Model class's [ModelMetadataType] partial class. (a.k.a. Buddy Class).☆10Apr 24, 2022Updated 3 years ago
- turnkey self-hosted offline transcription and diarization service with llm summary☆918Jan 18, 2026Updated 3 weeks ago
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- My personal GitHub Copilot prompts (copilot-instructions)☆17Updated this week
- Quickly build your full stack deployable development project.☆21Jan 11, 2026Updated last month
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆20Jun 16, 2025Updated 7 months ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆48Aug 6, 2024Updated last year
- Runpod WhisperX Docker Container Repo☆15Mar 10, 2024Updated last year
- YouTube直播之錄影機 (bash in Docker)☆17Jul 17, 2023Updated 2 years ago
- Тестовый пример задействования модели для идентификации голоса с помощью библиотеки распознавания речи "Vosk" (Воск): https://alphacephei…☆12Aug 14, 2023Updated 2 years ago
- Flight Recorder allows to record client program execution and examine it later☆11Sep 18, 2020Updated 5 years ago
- A nearly-live implementation of OpenAI's Whisper.☆3,803Updated this week
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆2,852Nov 7, 2025Updated 3 months ago
- Real time transcription with OpenAI Whisper.☆2,909Apr 15, 2025Updated 9 months ago
- ☆8,809Oct 25, 2025Updated 3 months ago
- Self-hosted AI audio transcription☆2,027Feb 1, 2026Updated last week
- ☆12Oct 23, 2022Updated 3 years ago
- ☆11Jun 8, 2023Updated 2 years ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆26Dec 18, 2025Updated last month