FastAPI service on top of WhisperX
☆180Jun 17, 2026Updated this week
Alternatives and similar repositories for whisperX-FastAPI
Users that are interested in whisperX-FastAPI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆18Aug 24, 2023Updated 2 years ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆451Jun 7, 2026Updated last week
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- WhisperX FastAPI integration☆18Mar 31, 2024Updated 2 years ago
- ☆3,393Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WhisperX Service love docker!☆18Aug 17, 2024Updated last year
- 基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用☆22Apr 26, 2025Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆26Dec 10, 2024Updated last year
- 🎙️ Drop-in replacement for paid transcription APIs. Self-hosted, GPU-powered, speaker diarization. Free forever: uvx murmurai☆41Dec 17, 2025Updated 6 months ago
- turnkey self-hosted offline transcription and diarization service with llm summary☆937Jan 18, 2026Updated 5 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆22,462Jun 3, 2026Updated 2 weeks ago
- Real time faster whisper gradio☆25Aug 17, 2025Updated 10 months ago
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- This is the code for the project called IoT attendance system made usign ESP32 and Fingerprint sensor☆11Dec 11, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenAI Whisper ASR Webservice API☆3,282Nov 23, 2025Updated 6 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆64Jan 20, 2025Updated last year
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 10 months ago
- This repository provides a Docker image for CosyVoice☆27Dec 22, 2024Updated last year
- A nearly-live implementation of OpenAI's Whisper.☆4,092Updated this week
- 📞 AGI interface with python for speech recognition☆30Feb 19, 2024Updated 2 years ago
- An AI chat bot based on volcengine's webRTC protocol.☆35May 6, 2026Updated last month
- ☆14Aug 1, 2025Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,640Nov 12, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A basic HTTP API for handling Faster Whisper audio transcriptions over the network☆34Jul 29, 2025Updated 10 months ago
- 基于micropython的xiaozhi☆40Apr 19, 2025Updated last year
- Генератор российских автомобильных номеров☆16Dec 24, 2020Updated 5 years ago
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for ef…☆28Feb 28, 2026Updated 3 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,563Feb 23, 2026Updated 3 months ago
- 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.☆11Sep 13, 2024Updated last year
- ✒️ LanguageTool integration for Quill.js editors☆17Aug 20, 2024Updated last year
- Using image caption models to extract prompts in ComfyUI☆12May 21, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Dockerized Screaming Frog SEO Spider☆13May 22, 2023Updated 3 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆957Jun 3, 2025Updated last year
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆48Oct 13, 2024Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆45Nov 29, 2023Updated 2 years ago
- Transcription and annotation interface for recorded audio or video files☆57Updated this week
- An easy-to-use GUI addon for whisper-standalone-win. Designed for those who prefer a simple interface over typing commands and file paths…☆12Dec 26, 2023Updated 2 years ago
- ☆11Jan 6, 2024Updated 2 years ago