LAION-AI / natural_voice_assistantLinks
☆490Updated last year
Alternatives and similar repositories for natural_voice_assistant
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆370Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆659Updated last month
- Whisper with Medusa heads☆850Updated this week
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆214Updated last week
- Command Your World with Voice☆737Updated last month
- ☆244Updated last month
- TTS with The Massively Multilingual Speech (MMS) project☆234Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- ☆205Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆449Updated 11 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆747Updated 5 months ago
- ☆337Updated last year
- ☆1,131Updated 5 months ago
- ☆98Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆780Updated 11 months ago
- llama.cpp with BakLLaVA model describes what does it see☆382Updated last year
- ☆260Updated last year
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆310Updated last year
- Implementation of F5-TTS in MLX☆571Updated 4 months ago
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 11 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆836Updated 8 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆583Updated 2 years ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆303Updated last month
- An AI assistant beyond the chat box.☆328Updated last year
- first base model for full-duplex conversational audio☆1,747Updated 7 months ago
- ☆158Updated 2 years ago
- Simulates talk with an AI that can express emotions☆77Updated last month
- On-device intelligence.☆367Updated 4 months ago