coqui-ai / whisperXLinks

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

☆50

Alternatives and similar repositories for whisperX

Users that are interested in whisperX are comparing it to the libraries listed below

Sorting:

Picovoice / orca
On-device streaming text-to-speech engine powered by deep learning
☆97Updated 3 weeks ago
shahizat / JetsonGPT
Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech
☆120Updated 2 years ago
lablab-ai / whisper-api-flask
☆100Updated 2 years ago
ancs21 / awesome-openai-whisper
A curated list of awesome OpenAI's Whisper
☆101Updated last year
runpod-workers / worker-faster_whisper
faster-whisper as serverless endpoint
☆108Updated last month
lxe / llm-companion
Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs
☆43Updated last year
OpenVoiceOS / ovos-tts-plugin-piper
☆27Updated last week
Picovoice / pico-cookbook
Recipes for on-device voice AI and local LLM
☆88Updated last month
Picovoice / eagle
On-device speaker recognition engine powered by deep learning
☆37Updated 3 weeks ago
geekodour / wscribe-editor
web based editor for subtitles and transcripts
☆137Updated 11 months ago
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆116Updated 2 years ago
ochen1 / insanely-fast-whisper-cli
The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️
☆363Updated last year
CalvesGEH / VoiceCraftAPI
An API for VoiceCraft.
☆25Updated last year
metavoiceio / MetaVoiceLive
☆73Updated last year
menloresearch / ichigo-demo
☆91Updated 2 months ago
Majdoddin / lexicaps
Transcription and Diarization based on OpenAI's Whisper
☆23Updated last year
geekodour / wscribe
ez audio transcription tool with flexible processing and post-processing options
☆155Updated last year
Alireza29675 / whisper-live
TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.
☆71Updated last year
playht / pyht
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
☆212Updated last month
coqui-ai / xtts-streaming-server
☆336Updated last year
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆220Updated 3 months ago
KoljaB / LocalEmotionalAIVoiceChat
Simulates talk with an AI that can express emotions
☆75Updated 3 weeks ago
BBC-Esq / Faster-Whisper-Transcriber
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
☆133Updated 3 weeks ago
sidharthrajaram / StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
☆160Updated last year
rpdrewes / whisper-websocket-server
Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.
☆64Updated last year
Softcatala / open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…
☆260Updated last week
tincans-ai / gazelle
Joint speech-language model - respond directly to audio!
☆371Updated last year
deepgram-starters / flask-live-transcription
Get started using Deepgram's Live Transcription with this Flask demo app
☆35Updated 2 weeks ago
revdotcom / reverb-self-hosted
This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.
☆52Updated 7 months ago
lucataco / cog-xtts-v2
Cog wrapper for Coqui / xtts-v2
☆75Updated 7 months ago