coqui-ai / whisperXLinks
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆50Updated last year
Alternatives and similar repositories for whisperX
Users that are interested in whisperX are comparing it to the libraries listed below
Sorting:
- On-device streaming text-to-speech engine powered by deep learning☆97Updated 3 weeks ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆120Updated 2 years ago
- ☆100Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- faster-whisper as serverless endpoint☆108Updated last month
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- ☆27Updated last week
- Recipes for on-device voice AI and local LLM☆88Updated last month
- On-device speaker recognition engine powered by deep learning☆37Updated 3 weeks ago
- web based editor for subtitles and transcripts☆137Updated 11 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆116Updated 2 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆363Updated last year
- An API for VoiceCraft.☆25Updated last year
- ☆73Updated last year
- ☆91Updated 2 months ago
- Transcription and Diarization based on OpenAI's Whisper☆23Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆155Updated last year
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆71Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆212Updated last month
- ☆336Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆220Updated 3 months ago
- Simulates talk with an AI that can express emotions☆75Updated 3 weeks ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆133Updated 3 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆260Updated last week
- Joint speech-language model - respond directly to audio!☆371Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo app☆35Updated 2 weeks ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 7 months ago
- Cog wrapper for Coqui / xtts-v2☆75Updated 7 months ago