coqui-ai / whisperXLinks
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆56Updated last year
Alternatives and similar repositories for whisperX
Users that are interested in whisperX are comparing it to the libraries listed below
Sorting:
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆246Updated 4 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆230Updated 10 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆219Updated 4 months ago
- Recipes for on-device voice AI and local LLM☆103Updated this week
- Shared Voice Interface☆43Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆384Updated last year
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆284Updated 2 weeks ago
- ☆34Updated last month
- On-device streaming text-to-speech engine powered by deep learning☆127Updated last week
- ☆100Updated last year
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆95Updated last month
- streaming speech to text server using Whisper☆98Updated 2 years ago
- web based editor for subtitles and transcripts☆142Updated last year
- Pinokio System Programming☆35Updated last year
- ☆38Updated 3 years ago
- ☆75Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆128Updated 2 years ago
- Simulates talk with an AI that can express emotions☆83Updated 6 months ago
- faster-whisper as serverless endpoint☆126Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated 2 years ago
- AI writing assistant with voiced narrator and characters and an illustrator☆38Updated 9 months ago
- API server for Instant voice cloning by MyShell.☆106Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆167Updated 2 weeks ago
- Voice models for Mimic 3 text to speech system☆160Updated last year