FastAPI service on top of WhisperX
☆179May 2, 2026Updated last week
Alternatives and similar repositories for whisperX-FastAPI
Users that are interested in whisperX-FastAPI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆18Aug 24, 2023Updated 2 years ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆429Apr 5, 2026Updated last month
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- WhisperX FastAPI integration☆18Mar 31, 2024Updated 2 years ago
- ☆3,243Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- WhisperX Service love docker!☆18Aug 17, 2024Updated last year
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆91Feb 2, 2025Updated last year
- turnkey self-hosted offline transcription and diarization service with llm summary☆929Jan 18, 2026Updated 3 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆22Nov 4, 2024Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,760Apr 4, 2026Updated last month
- A webscraper for www.willhaben.at☆13Mar 21, 2025Updated last year
- OpenAI Whisper ASR Webservice API☆3,251Nov 23, 2025Updated 5 months ago
- A ComfyUI extension for generating captions of images.☆29May 12, 2025Updated 11 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆64Jan 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 8 months ago
- This repository provides a Docker image for CosyVoice☆27Dec 22, 2024Updated last year
- A nearly-live implementation of OpenAI's Whisper.☆4,001Apr 21, 2026Updated 2 weeks ago
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆15Apr 19, 2026Updated 2 weeks ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆23Jun 5, 2025Updated 11 months ago
- An AI chat bot based on volcengine's webRTC protocol.☆35Apr 27, 2025Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,611Nov 12, 2025Updated 5 months ago
- A basic HTTP API for handling Faster Whisper audio transcriptions over the network☆33Jul 29, 2025Updated 9 months ago
- whisper.cpp HTTP transcription server with OpenAI-like API in Docker☆32Apr 5, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for ef…☆28Feb 28, 2026Updated 2 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,511Feb 23, 2026Updated 2 months ago
- Personal assistant, project and schedule manager, coach, motivator, angry girlfriend and salvation - character AI waifu llm based on olla…☆12Jan 26, 2026Updated 3 months ago
- Using image caption models to extract prompts in ComfyUI☆11May 21, 2025Updated 11 months ago
- A simple implementation of real-time output device audio transcription and translation using "faster_whisper" and "pyaudiowpatch".☆22May 6, 2023Updated 3 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆17Oct 12, 2024Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆945Jun 3, 2025Updated 11 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆45Nov 29, 2023Updated 2 years ago
- Transcription and annotation interface for recorded audio or video files☆52Apr 30, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Extract dominant or complementary color palettes from images. Convert colors to English names suitable for txt2img prompts.☆16Jan 5, 2025Updated last year
- Extract individual frames from a video as png images (android)☆13Dec 30, 2022Updated 3 years ago
- ☆28Apr 14, 2026Updated 3 weeks ago
- short youtube video summaries☆20Jun 29, 2025Updated 10 months ago
- Inference app for a FP8-quantized flux1-dev model. This runs on graphic cards with 16 GB of VRAM.☆41Mar 5, 2025Updated last year
- llama.cpp fork with additional SOTA quants and improved performance☆22Updated this week
- An easy-to-use GUI addon for whisper-standalone-win. Designed for those who prefer a simple interface over typing commands and file paths…☆12Dec 26, 2023Updated 2 years ago