LAION-AI / natural_voice_assistantLinks
☆488Updated last year
Alternatives and similar repositories for natural_voice_assistant
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆369Updated 11 months ago
- ☆235Updated last week
- ☆1,134Updated 4 months ago
- Whisper with Medusa heads☆842Updated 3 weeks ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,613Updated 10 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆647Updated last week
- Command Your World with Voice☆709Updated last week
- A ggml (C++) re-implementation of tortoise-tts☆186Updated 10 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Interface for OuteTTS models.☆1,318Updated this week
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆656Updated 8 months ago
- ☆258Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated 11 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆582Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch☆1,517Updated 2 months ago
- llama.cpp with BakLLaVA model describes what does it see☆383Updated last year
- An AI assistant beyond the chat box.☆328Updated last year
- Controllable and fast Text-to-Speech for over 7000 languages!☆1,617Updated last month
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆826Updated 7 months ago
- Implementation of F5-TTS in MLX☆554Updated 3 months ago
- ☆270Updated last year
- A multimodal, function calling powered LLM webui.☆214Updated 9 months ago
- first base model for full-duplex conversational audio☆1,749Updated 5 months ago
- The code for the bark-voicecloning model. Training and inference.☆703Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆433Updated 9 months ago
- ☆97Updated last year
- Fast TorToiSe inference (5x or your money back!)☆827Updated 11 months ago
- Official Implementation of StyleTTS☆435Updated 5 months ago
- ☆205Updated last year