FastAPI service on top of WhisperX
☆174Mar 24, 2026Updated this week
Alternatives and similar repositories for whisperX-FastAPI
Users that are interested in whisperX-FastAPI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆17Aug 24, 2023Updated 2 years ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆423Mar 22, 2026Updated last week
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- WhisperX FastAPI integration☆18Mar 31, 2024Updated last year
- ☆3,081Mar 22, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- WhisperX Service love docker!☆18Aug 17, 2024Updated last year
- 基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用☆21Apr 26, 2025Updated 11 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12Mar 10, 2023Updated 3 years ago
- 🎙️ Drop-in replacement for paid transcription APIs. Self-hosted, GPU-powered, speaker diarization. Free forever: uvx murmurai☆39Dec 17, 2025Updated 3 months ago
- Brace is an LLM-powered course assistant to help with teaching feedback-intensive courses with large student populations.☆12Jan 22, 2026Updated 2 months ago
- Real time faster whisper gradio☆25Aug 17, 2025Updated 7 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,952Updated this week
- A cross platform image manipulation desktop application☆19Sep 7, 2025Updated 6 months ago
- This is the code for the project called IoT attendance system made usign ESP32 and Fingerprint sensor☆11Dec 11, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- OpenAI Whisper ASR Webservice API☆3,206Nov 23, 2025Updated 4 months ago
- A ComfyUI extension for generating captions of images.☆29May 12, 2025Updated 10 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆65Jan 20, 2025Updated last year
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 7 months ago
- Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features☆14Updated this week
- An AI chat bot based on volcengine's webRTC protocol.☆34Apr 27, 2025Updated 11 months ago
- A nearly-live implementation of OpenAI's Whisper.☆3,914Mar 17, 2026Updated last week
- Subscribe to Mosquitto MQTT Broker and Publish data to MySQL Database☆13Jan 28, 2020Updated 6 years ago
- whisper.cpp HTTP transcription server with OpenAI-like API in Docker☆29Aug 3, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆19Oct 4, 2019Updated 6 years ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,458Feb 23, 2026Updated last month
- ✒️ LanguageTool integration for Quill.js editors☆17Aug 20, 2024Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,572Nov 12, 2025Updated 4 months ago
- Record audio or transcribe files using ctranslate2 and whisper!☆184Updated this week
- Java Go Websocket ESP32 实现视频流图传☆23Oct 6, 2022Updated 3 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆13May 24, 2022Updated 3 years ago
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆47Oct 13, 2024Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆45Nov 29, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Extract dominant or complementary color palettes from images. Convert colors to English names suitable for txt2img prompts.☆16Jan 5, 2025Updated last year
- Extract individual frames from a video as png images (android)☆13Dec 30, 2022Updated 3 years ago
- ☆11Jan 6, 2024Updated 2 years ago
- An easy-to-use GUI addon for whisper-standalone-win. Designed for those who prefer a simple interface over typing commands and file paths…☆13Dec 26, 2023Updated 2 years ago
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- A custom logger for your NestJS application using Winston, logging class name, function name, time execution ... in a clean way☆21Oct 24, 2023Updated 2 years ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆48Aug 6, 2024Updated last year