LAION-AI / natural_voice_assistant
☆478Updated 9 months ago
Alternatives and similar repositories for natural_voice_assistant:
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
- Joint speech-language model - respond directly to audio!☆366Updated 8 months ago
- Command Your World with Voice☆595Updated 2 months ago
- ☆1,113Updated 2 weeks ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,582Updated 7 months ago
- ☆95Updated 10 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆587Updated 6 months ago
- ☆266Updated 8 months ago
- ☆204Updated 5 months ago
- Whisper with Medusa heads☆823Updated this week
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆784Updated 3 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch☆440Updated 2 weeks ago
- A ggml (C++) re-implementation of tortoise-tts☆175Updated 6 months ago
- A talking LLM that runs on your own computer without needing the internet.☆402Updated 6 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆749Updated 6 months ago
- Interface for OuteTTS models.☆940Updated 2 weeks ago
- ☆253Updated 11 months ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆656Updated 9 months ago
- A multimodal, function calling powered LLM webui.☆215Updated 5 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆319Updated 2 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆380Updated 2 weeks ago
- ☆314Updated 8 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆719Updated last month
- ☆348Updated 6 months ago
- Implementation of F5-TTS in MLX☆489Updated last month
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆362Updated 6 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆479Updated last year
- ☆199Updated 9 months ago
- A simple FastAPI Server to run XTTSv2☆479Updated 7 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆882Updated 4 months ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆336Updated 4 months ago