pavelzbornik/whisperX-FastAPI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pavelzbornik/whisperX-FastAPI)

pavelzbornik / whisperX-FastAPI

FastAPI service on top of WhisperX

☆184

Alternatives and similar repositories for whisperX-FastAPI

Users that are interested in whisperX-FastAPI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tijszwinkels / whisperX-api
View on GitHub
The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…
☆18Aug 24, 2023Updated 2 years ago
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,320Jul 13, 2026Updated 2 weeks ago
speaches-ai / speaches
View on GitHub
☆3,548Updated this week
asaddi / f5-tts-serve
View on GitHub
A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…
☆14Feb 7, 2025Updated last year
jfgonsalves / parakeet-diarized
View on GitHub
Parakeet 0.6b V2 + Pyannote diarization behind a Whisper API
☆77Feb 21, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
chinaboard / whisperX-service
View on GitHub
WhisperX Service love docker!
☆18Aug 17, 2024Updated last year
ahmetoner / whisper-asr-webservice
View on GitHub
OpenAI Whisper ASR Webservice API
☆3,308Nov 23, 2025Updated 8 months ago
transcriptionstream / transcriptionstream
View on GitHub
turnkey self-hosted offline transcription and diarization service with llm summary
☆944Jan 18, 2026Updated 6 months ago
matatonic / openedai-whisper
View on GitHub
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
☆91Feb 2, 2025Updated last year
jkin8010 / fastrtc-talking-more
View on GitHub
基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用
☆22Apr 26, 2025Updated last year
phineas-pta / fine-tune-whisper-vi
View on GitHub
jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2
☆19Aug 15, 2025Updated 11 months ago
AIFSH / ComfyUI-WhisperX
View on GitHub
a comfyui cuatom node for audio subtitling based on whisperX and translators
☆64Apr 1, 2025Updated last year
patientx / F5-TTS-ONNX-gui
View on GitHub
Running the F5-TTS by ONNX Runtime standalone with GUI
☆27Dec 10, 2024Updated last year
linto-ai / linto-studio
View on GitHub
Transcription and annotation interface for recorded audio or video files
☆59Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,614Feb 23, 2026Updated 5 months ago
collabora / WhisperLive
View on GitHub
A nearly-live implementation of OpenAI's Whisper.
☆4,192Updated this week
ErcinDedeoglu / WhisperDock
View on GitHub
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for ef…
☆31Feb 28, 2026Updated 5 months ago
kristofferv98 / whisper_turboapi
View on GitHub
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX optimization
☆14Jun 5, 2025Updated last year
neverbiasu / ComfyUI-Image-Captioner
View on GitHub
A ComfyUI extension for generating captions of images.
☆29May 12, 2025Updated last year
Dschogo / whisperx-webui
View on GitHub
Transcribe with ease :D
☆16Jun 21, 2023Updated 3 years ago
prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
NanoNets / nanonets-id-card-digitization
View on GitHub
Python demo for ID card digitization using Nanonets
☆27Nov 28, 2019Updated 6 years ago
revdotcom / reverb-self-hosted
View on GitHub
This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.
☆55Dec 10, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ufal / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,657Nov 12, 2025Updated 8 months ago
ALM-LAB / PACE
View on GitHub
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…
☆17Dec 11, 2022Updated 3 years ago
cq535454518 / Whisper-Whisperx-GUI
View on GitHub
本工具是python tkinter编写的一个简单的Gui，任务批量管理器。通过Gui选项生成*CMD*(command),来调用whisper，达到批量生成，管理的目的。支持whisper和whisperx
☆58Aug 29, 2023Updated 2 years ago
catcto / CosyVoiceDocker
View on GitHub
This repository provides a Docker image for CosyVoice
☆27Dec 22, 2024Updated last year
InterfazeAI / insanely-fast-whisper-api
View on GitHub
An API to transcribe audio with OpenAI's Whisper Large v3!
☆355Nov 13, 2024Updated last year
yinruiqing / pyannote-whisper
View on GitHub
☆676Sep 24, 2025Updated 10 months ago
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,609Nov 19, 2025Updated 8 months ago
Woo-jin-Chung / MF-PAM_mfpam_pitch_estimation_pytorch
View on GitHub
☆16Sep 17, 2025Updated 10 months ago
yuanfangqiao / andas
View on GitHub
Java Go Websocket ESP32 实现视频流图传
☆23Oct 6, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
datobs / react-native-perspective-image-cropper
View on GitHub
Perform custom crop, resizing and perspective correction 📐🖼
☆11May 9, 2025Updated last year
EvilFreelancer / docker-whisper-server
View on GitHub
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
☆31Apr 5, 2026Updated 3 months ago
rmorlok / authproxy
View on GitHub
Authenticating proxy server for connecting to 3rd party APIs
☆17Updated this week
vantezzen / quill-languagetool
View on GitHub
✒️ LanguageTool integration for Quill.js editors
☆17Aug 20, 2024Updated last year
alpertunga-bile / image-caption-comfyui
View on GitHub
Using image caption models to extract prompts in ComfyUI
☆12May 21, 2025Updated last year
mallahyari / RealtimeSTT-TTS
View on GitHub
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability
☆47Nov 29, 2023Updated 2 years ago
RapidAI / RapidSpeech.cpp
View on GitHub
On-device speech AI runtime for ASR, TTS, VAD, and voice cloning. Python-simple, C++-native, GGUF-powered.
☆22Jul 15, 2026Updated 2 weeks ago