coqui-ai / whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆45Updated 7 months ago
Alternatives and similar repositories for whisperX:
Users that are interested in whisperX are comparing it to the libraries listed below
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆41Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- An API for VoiceCraft.☆26Updated 7 months ago
- On-device speaker recognition engine powered by deep learning☆32Updated this week
- A UI for the Piper TTS☆79Updated 5 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆40Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆150Updated 7 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- A QT GUI for large language models☆30Updated last year
- ☆23Updated 2 weeks ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆28Updated this week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆49Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆70Updated this week
- Self-hosted AI voice agent☆86Updated 5 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆109Updated last year
- The code for some apps built with Sieve.☆74Updated 2 months ago
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆76Updated 9 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆37Updated 5 months ago
- Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built with Next.js and Tailwind CSS.…☆14Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆17Updated 6 months ago
- ☆94Updated 9 months ago
- Something similar to Apple Intelligence?☆59Updated 7 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Transcription and Diarization based on OpenAI's Whisper☆21Updated last year
- G2P☆119Updated this week
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆27Updated 6 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 3 months ago
- Pybind11 bindings for Whisper.cpp☆50Updated 2 weeks ago
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- ☆198Updated 8 months ago