coqui-ai / whisperXLinks
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆55Updated last year
Alternatives and similar repositories for whisperX
Users that are interested in whisperX are comparing it to the libraries listed below
Sorting:
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆245Updated 3 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Voice models for Mimic 3 text to speech system☆157Updated last year
- ☆101Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆124Updated 2 years ago
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last week
- faster-whisper as serverless endpoint☆125Updated last week
- web based editor for subtitles and transcripts☆141Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- streaming speech to text server using Whisper☆98Updated 2 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆383Updated last year
- ☆354Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 11 months ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
- Speaker diarization model☆28Updated 2 years ago
- Coqui AI TTS plugin☆87Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- ☆100Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆231Updated 9 months ago
- Simulates talk with an AI that can express emotions☆82Updated 5 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆159Updated 2 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- On-device speaker recognition engine powered by deep learning☆38Updated last week
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆328Updated 4 months ago
- Transcription with speaker diarization pipeline☆97Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆115Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated last year