tijszwinkels / whisperX-apiLinks

The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-to-use endpoint for audio transcription and is packaged into a Docker container for easy deployment

☆15

Alternatives and similar repositories for whisperX-api

Users that are interested in whisperX-api are comparing it to the libraries listed below

Sorting:

pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆114Updated this week
gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆187Updated last year
jim60105 / docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …
☆330Updated this week
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆773Updated last month
BBC-Esq / Faster-Whisper-Transcriber
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
☆133Updated last month
yohasebe / whisper-stream
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
☆111Updated last year
revdotcom / reverb
Open source inference code for Rev's model
☆412Updated 3 months ago
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆221Updated 3 months ago
JigsawStack / insanely-fast-whisper-api
An API to transcribe audio with OpenAI's Whisper Large v3!
☆295Updated 8 months ago
chinaboard / whisperX-service
WhisperX Service love docker!
☆15Updated 11 months ago
alexgo84 / whisperx-server
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆12Updated 2 years ago
EtienneAb3d / WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆332Updated 8 months ago
yinruiqing / pyannote-whisper
☆607Updated last year
reriiasu / speech-to-text
Real-time transcription using faster-whisper
☆474Updated 11 months ago
alesaccoia / VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
☆892Updated 9 months ago
QuentinFuxa / WhisperLiveKit
Python package for Real-time, Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface
☆389Updated this week
ochen1 / insanely-fast-whisper-cli
The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️
☆364Updated last year
rudymohammadbali / Real-time-STT
Real-time Speech To Text using Faster Whisper.
☆57Updated 11 months ago
speaches-ai / speaches
☆2,118Updated this week
Wordcab / wordcab-transcribe
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆215Updated 8 months ago
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆96Updated last year
pipecat-ai / pipecat-client-web
Real-Time Voice Inference Web SDK
☆259Updated this week
astramind-ai / Auralis
A Fast TTS Engine
☆526Updated 5 months ago
luweigen / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆120Updated last year
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆698Updated 9 months ago
mldljyh / whisper_real_time_translation
The subtitles and translations are generated in real-time and displayed as pop-ups.
☆171Updated 2 years ago
echogarden-project / echogarden
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, sp…
☆382Updated last month
awexandrr / audioWhisper
Listen to any audio stream on your machine and print out the transcribed or translated audio.
☆119Updated last year
thomasmol / cog-whisper-diarization
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
☆217Updated 5 months ago
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,367Updated 5 months ago