coqui-ai / whisperXLinks
WhisperX:  Automatic Speech Recognition with Word-level Timestamps (& Diarization)
β54Updated last year
Alternatives and similar repositories for whisperX
Users that are interested in whisperX are comparing it to the libraries listed below
Sorting:
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- β91Updated 5 months ago
- β31Updated last month
- AI writing assistant with voiced narrator and characters and an illustratorβ38Updated 7 months ago
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acceβ¦β119Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- Simulates talk with an AI that can express emotionsβ80Updated 4 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β58Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β238Updated 2 months ago
- Voice models for Mimic 3 text to speech systemβ154Updated last year
- Site for sharing Bark voicesβ51Updated 7 months ago
- β99Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speechβ126Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β54Updated 10 months ago
- On-device streaming text-to-speech engine powered by deep learningβ122Updated last month
- faster-whisper as serverless endpointβ121Updated 5 months ago
- β74Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β306Updated 3 months ago
- π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkβ68Updated 3 months ago
- web based editor for subtitles and transcriptsβ141Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo appβ40Updated this week
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIsβ43Updated last year
- β101Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β67Updated last year
- Coqui AI TTS pluginβ87Updated 3 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ379Updated last year
- API server for Instant voice cloning by MyShell.β104Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning APIβ217Updated 2 months ago
- ez audio transcription tool with flexible processing and post-processing optionsβ159Updated last year
- On-device speaker recognition engine powered by deep learningβ37Updated 2 months ago