KoljaB / LocalAIVoiceChatLinks
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
☆712Updated 7 months ago
Alternatives and similar repositories for LocalAIVoiceChat
Users that are interested in LocalAIVoiceChat are comparing it to the libraries listed below
Sorting:
- Command Your World with Voice☆801Updated 7 months ago
- A simple FastAPI Server to run XTTSv2☆571Updated last year
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆763Updated 11 months ago
- Webui for using XTTS and for finetuning it☆867Updated last year
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆385Updated last year
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆314Updated 7 months ago
- Simulates talk with an AI that can express emotions☆82Updated 7 months ago
- Local SRT/LLM/TTS Voicechat☆750Updated last year
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆367Updated 3 months ago
- A talking LLM that runs on your own computer without needing the internet.☆774Updated 3 months ago
- A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech mode…☆1,105Updated 2 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆849Updated last year
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆355Updated 6 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,232Updated 3 weeks ago
- ☆497Updated last year
- Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.☆1,183Updated 6 months ago
- ☆359Updated last year
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆343Updated 8 months ago
- Interface for OuteTTS models.☆1,421Updated 7 months ago
- Plugin that lets you ask questions about your documents including audio and video files.☆360Updated this week
- Run Orpheus 3B Locally With LM Studio☆510Updated 10 months ago
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆307Updated last week
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆245Updated last year
- API server for Instant voice cloning by MyShell.☆107Updated last year
- ☆784Updated 7 months ago
- Slightly improved official version for finetune xtts☆382Updated 10 months ago
- ☆522Updated 2 months ago
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs or Kokoro☆380Updated last week
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆431Updated 4 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆180Updated 2 years ago