LAION-AI / natural_voice_assistantLinks
☆491Updated last year
Alternatives and similar repositories for natural_voice_assistant
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆371Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆671Updated 2 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- ☆1,134Updated 6 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- ☆206Updated last year
- ☆248Updated 2 months ago
- Whisper with Medusa heads☆852Updated 3 weeks ago
- Command Your World with Voice☆739Updated 2 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆459Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆216Updated 2 weeks ago
- ☆99Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆249Updated 2 years ago
- ☆158Updated 2 years ago
- ☆340Updated last year
- ☆262Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆187Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆784Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆382Updated last year
- Sesame CSM 1B Voice Cloning☆320Updated 5 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆751Updated 5 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆99Updated 2 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated 9 months ago
- Implementation of F5-TTS in MLX☆574Updated 5 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆835Updated 9 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Python bindings for whisper.cpp☆282Updated this week
- Interface for OuteTTS models.☆1,365Updated 2 months ago
- 🐮📢 The first AI voice assistant that interrupts *you*☆149Updated 11 months ago