speaches-ai / speaches
☆1,582Updated this week
Alternatives and similar repositories for speaches:
Users that are interested in speaches are comparing it to the libraries listed below
- A nearly-live implementation of OpenAI's Whisper.☆2,600Updated last month
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆726Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translation☆2,633Updated 2 months ago
- Interface for OuteTTS models.☆957Updated last month
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆639Updated 3 months ago
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆2,112Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆1,825Updated last week
- A python package to build AI-powered real-time audio applications☆1,219Updated last month
- Converts text to speech in realtime☆2,727Updated last week
- OpenAI Whisper ASR Webservice API☆2,453Updated last month
- Local realtime voice AI☆2,264Updated 3 weeks ago
- Whisper with Medusa heads☆821Updated last month
- first base model for full-duplex conversational audio☆1,722Updated 2 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆4,294Updated 2 weeks ago
- Local SRT/LLM/TTS Voicechat☆644Updated 5 months ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆247Updated last week
- TTS with kokoro and onnx runtime☆1,809Updated 3 weeks ago
- A Fast TTS Engine☆472Updated 2 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆376Updated 7 months ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆833Updated 5 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆1,140Updated 2 weeks ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆312Updated 2 weeks ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,619Updated last week
- Real-time, Fully Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface☆159Updated this week
- A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech mode…☆943Updated 4 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆926Updated last month
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs☆618Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,588Updated 7 months ago
- turnkey self-hosted offline transcription and diarization service with llm summary☆825Updated 6 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆996Updated last month