KoljaB / LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
☆555Updated 5 months ago
Alternatives and similar repositories for LocalAIVoiceChat:
Users that are interested in LocalAIVoiceChat are comparing it to the libraries listed below
- Command Your World with Voice☆506Updated last month
- Webui for using XTTS and for finetuning it☆710Updated 3 months ago
- A simple FastAPI Server to run XTTSv2☆447Updated 5 months ago
- Local SRT/LLM/TTS Voicechat☆590Updated 3 months ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆268Updated 7 months ago
- Slightly improved official version for finetune xtts☆289Updated 2 months ago
- ☆308Updated 6 months ago
- A talking LLM that runs on your own computer without needing the internet.☆347Updated 5 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆298Updated last month
- ☆90Updated 8 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆622Updated 4 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,320Updated last week
- API server for Instant voice cloning by MyShell.☆80Updated 3 months ago
- ☆1,110Updated this week
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆264Updated last year
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆343Updated last year
- A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech mode…☆866Updated 2 months ago
- Interface for OuteTTS models.☆859Updated this week
- Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.☆839Updated last week
- Converts text to speech in realtime☆2,287Updated this week
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆188Updated 2 months ago
- ☆1,106Updated 6 months ago
- The code for the bark-voicecloning model. Training and inference.☆681Updated last year
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆686Updated last month
- A Gradio UI for XTTSv2 and RVC.☆156Updated 7 months ago
- ☆189Updated 3 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆174Updated 3 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆331Updated 7 months ago
- Live-Transcription (STT) with Whisper PoC☆165Updated 6 months ago
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆632Updated this week