coqui-ai / whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆36Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for whisperX
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆36Updated 10 months ago
- An API for VoiceCraft.☆26Updated 4 months ago
- ☆87Updated 6 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆43Updated 3 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆47Updated last month
- Pybind11 bindings for Whisper.cpp☆45Updated 3 weeks ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆63Updated this week
- ☆82Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆32Updated 2 weeks ago
- A UI for the Piper TTS☆67Updated 2 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆21Updated last month
- On-device streaming text-to-speech engine powered by deep learning☆56Updated 2 weeks ago
- ☆296Updated 4 months ago
- Rivet plugin for integration with Ollama, the tool for running LLMs locally easily☆34Updated 7 months ago
- web based editor for subtitles and transcripts☆112Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- Text generation in Python, as easy as possible☆42Updated this week
- A Qt GUI for large language models☆40Updated last year
- ☆68Updated 8 months ago
- Site for sharing Bark voices☆48Updated 4 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆50Updated 6 months ago
- ☆89Updated last year
- Utility library to work with character cards and roleplay AI in general☆23Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- ☆51Updated 2 months ago
- ☆45Updated 6 months ago
- https://narrateit.streamlit.app/☆32Updated 5 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 7 months ago
- 100% free, local & offline voice assistant with speech recognition☆58Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆138Updated 4 months ago