jim60105 / docker-whisperXLinks

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

☆340

Alternatives and similar repositories for docker-whisperX

Users that are interested in docker-whisperX are comparing it to the libraries listed below

Sorting:

NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆223Updated 3 months ago
BBC-Esq / Faster-Whisper-Transcriber
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
☆136Updated 2 weeks ago
shashikg / WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
☆449Updated 11 months ago
EtienneAb3d / WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆334Updated 8 months ago
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆791Updated 2 months ago
ochen1 / insanely-fast-whisper-cli
The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️
☆364Updated last year
speaches-ai / speaches
☆2,179Updated this week
pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆120Updated last week
QuentinFuxa / WhisperLiveKit
Python package for Real-time, Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface
☆420Updated this week
yinruiqing / pyannote-whisper
☆615Updated last year
absadiki / pywhispercpp
Python bindings for whisper.cpp
☆278Updated last month
JigsawStack / insanely-fast-whisper-api
An API to transcribe audio with OpenAI's Whisper Large v3!
☆296Updated 8 months ago
lablab-ai / Whisper-transcription_and_diarization-speaker-identification-
How to use OpenAIs Whisper to transcribe and diarize audio files
☆349Updated 2 years ago
JSchmie / ScrAIbe
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
☆52Updated 6 months ago
Softcatala / whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆1,074Updated 2 months ago
Wordcab / wordcab-transcribe
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆215Updated 9 months ago
KoljaB / Linguflex
Command Your World with Voice
☆737Updated last month
gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆189Updated last year
Softcatala / open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…
☆271Updated 3 weeks ago
matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆797Updated 6 months ago
thomasmol / cog-whisper-diarization
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
☆218Updated 5 months ago
alesaccoia / VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
☆898Updated 10 months ago
Majdoddin / nlp
☆486Updated last year
KoljaB / LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…
☆659Updated last month
Sharrnah / whispering
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…
☆456Updated last week
themanyone / whisper_dictation
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
☆256Updated last month
mldljyh / whisper_real_time_translation
The subtitles and translations are generated in real-time and displayed as pop-ups.
☆171Updated 2 years ago
davabase / transcriber_app
Real time speech to text transcription app.
☆420Updated 2 years ago
NeuralVox / StyleTTS2
☆98Updated last year
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,387Updated 5 months ago