tijszwinkels / whisperX-apiLinks
The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-to-use endpoint for audio transcription and is packaged into a Docker container for easy deployment
☆15Updated last year
Alternatives and similar repositories for whisperX-api
Users that are interested in whisperX-api are comparing it to the libraries listed below
Sorting:
- FastAPI service on top of WhisperX☆114Updated this week
- Live-Transcription (STT) with Whisper PoC☆187Updated last year
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆330Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆773Updated last month
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆133Updated last month
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆111Updated last year
- Open source inference code for Rev's model☆412Updated 3 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆221Updated 3 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆295Updated 8 months ago
- WhisperX Service love docker!☆15Updated 11 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆332Updated 8 months ago
- ☆607Updated last year
- Real-time transcription using faster-whisper☆474Updated 11 months ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆892Updated 9 months ago
- Python package for Real-time, Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface☆389Updated this week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆364Updated last year
- Real-time Speech To Text using Faster Whisper.☆57Updated 11 months ago
- ☆2,118Updated this week
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆215Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Real-Time Voice Inference Web SDK☆259Updated this week
- A Fast TTS Engine☆526Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- Local SRT/LLM/TTS Voicechat☆698Updated 9 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆171Updated 2 years ago
- Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, sp…☆382Updated last month
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆217Updated 5 months ago
- A python package to build AI-powered real-time audio applications☆1,367Updated 5 months ago