nkilm / offline-whisperx
Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.
☆33Updated 5 months ago
Alternatives and similar repositories for offline-whisperx:
Users that are interested in offline-whisperx are comparing it to the libraries listed below
- Transcription and diarization (speaker identification)☆31Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆50Updated 2 months ago
- FastAPI service on top of WhisperX☆68Updated 3 weeks ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- streaming speech to text server using Whisper☆86Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week
- Python bindings for whisper.cpp☆221Updated this week
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆84Updated 2 weeks ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆225Updated 2 weeks ago
- I built a Voice Assistant with ChatGPT, Whisper API, Gradio, and TTS APIs☆53Updated last year
- ☆200Updated 4 months ago
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆25Updated last week
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆47Updated last week
- web based editor for subtitles and transcripts☆121Updated 6 months ago
- A curated list of awesome OpenAI's Whisper☆99Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- Open source inference code for Rev's model☆377Updated last month
- ☆94Updated 9 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆360Updated 5 months ago
- ez audio transcription tool with flexible processing and post-processing options☆144Updated last year
- ☆34Updated 4 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆119Updated 2 weeks ago
- ☆80Updated 7 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆122Updated 8 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 3 months ago
- Site for sharing Bark voices☆48Updated 7 months ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 6 months ago