LAION-AI / natural_voice_assistantLinks
☆494Updated last year
Alternatives and similar repositories for natural_voice_assistant
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆372Updated last year
- ☆207Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆699Updated 5 months ago
- Whisper with Medusa heads☆865Updated 3 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- ☆261Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆381Updated 2 years ago
- Command Your World with Voice☆781Updated 5 months ago
- ☆175Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆191Updated last year
- ☆354Updated last year
- Fine Tune the Style-TTS2 Voice Model☆262Updated 5 months ago
- ☆1,138Updated 9 months ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆334Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆589Updated 2 years ago
- ☆158Updated 2 years ago
- An AI assistant beyond the chat box.☆328Updated last year
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆315Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆487Updated last year
- Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch☆1,539Updated 7 months ago
- A multimodal, function calling powered LLM webui.☆217Updated last year
- Pybind11 bindings for Whisper.cpp☆341Updated 11 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆760Updated 9 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆848Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- 🐮📢 The first AI voice assistant that interrupts *you*☆148Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,643Updated last year