coqui-ai / whisperXLinks
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆56Updated last year
Alternatives and similar repositories for whisperX
Users that are interested in whisperX are comparing it to the libraries listed below
Sorting:
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆121Updated last week
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆245Updated 4 months ago
- ☆33Updated 3 weeks ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆334Updated 5 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆126Updated 2 years ago
- Coqui AI TTS plugin☆85Updated 5 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- On-device speaker recognition engine powered by deep learning☆38Updated last week
- faster-whisper as serverless endpoint☆125Updated 3 weeks ago
- Simulates talk with an AI that can express emotions☆82Updated 6 months ago
- Voice models for Mimic 3 text to speech system☆160Updated last year
- This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.☆16Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Transcription and Diarization based on OpenAI's Whisper☆24Updated 3 months ago
- ☆100Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆278Updated last week
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆127Updated last year
- ☆75Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆383Updated last year
- web based editor for subtitles and transcripts☆142Updated last year
- A UI for the Piper TTS☆106Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆162Updated this week
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- API server for Instant voice cloning by MyShell.☆106Updated last year
- streaming speech to text server using Whisper☆98Updated 2 years ago
- Text to speech chrome extension that works with local openai compatible TTS models☆29Updated 9 months ago
- Link you Ollama models to LM-Studio☆148Updated last year