AIWintermuteAI / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
☆23Updated 4 months ago
Alternatives and similar repositories for WhisperLive
Users that are interested in WhisperLive are comparing it to the libraries listed below
Sorting:
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆218Updated 3 months ago
- This Docker image provides a convenient environment for running OpenAI Whisper, a powerful automatic speech recognition (ASR) system.☆86Updated 5 months ago
- API server for Instant voice cloning by MyShell.☆92Updated 7 months ago
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆87Updated last week
- Self-hosted AI voice agent☆102Updated 8 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 9 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆114Updated last week
- Simulates talk with an AI that can express emotions☆69Updated 9 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆73Updated last month
- A talking LLM that runs on your own computer without needing the internet.☆462Updated 9 months ago
- On-device streaming text-to-speech engine powered by deep learning☆79Updated last week
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆324Updated last year
- Pybind11 bindings for Whisper.cpp☆57Updated 2 weeks ago
- ☆58Updated 5 months ago
- Clip any moment from any video with prompts☆115Updated 4 months ago
- On-device speaker recognition engine powered by deep learning☆35Updated last week
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆328Updated 3 weeks ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆276Updated last week
- Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration, voice cloning, and emotion control. Supports both Transforme…☆35Updated 2 months ago
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoro☆230Updated last week
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 6 months ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆57Updated 3 months ago
- ☆75Updated 2 months ago
- G2P☆239Updated 2 weeks ago
- Automatic Speech Recognition Assistant☆34Updated 11 months ago
- Chat with your pdf using your local LLM, OLLAMA client.(incomplete)☆37Updated 6 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆636Updated 9 months ago
- ☆43Updated 3 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆353Updated 3 weeks ago
- 🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!☆275Updated 2 months ago