LAION-AI / natural_voice_assistant
☆482Updated 10 months ago
Alternatives and similar repositories for natural_voice_assistant:
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
- Whisper with Medusa heads☆830Updated last month
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆758Updated 8 months ago
- Command Your World with Voice☆642Updated 4 months ago
- ☆216Updated 3 weeks ago
- Joint speech-language model - respond directly to audio!☆368Updated 9 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆606Updated 8 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,591Updated 8 months ago
- ☆254Updated last year
- ☆95Updated 11 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆387Updated 7 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated 8 months ago
- ☆1,123Updated 2 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆796Updated 4 months ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆647Updated 6 months ago
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 7 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆231Updated 3 weeks ago
- Interface for OuteTTS models.☆1,111Updated this week
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 11 months ago
- Collection of Open Source Speech Data☆153Updated 5 months ago
- ☆156Updated last year
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch☆462Updated last month
- ☆354Updated 7 months ago
- Fast TorToiSe inference (5x or your money back!)☆807Updated 9 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆186Updated last month
- ☆269Updated 10 months ago
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆543Updated 4 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- An AI assistant beyond the chat box.☆325Updated last year
- G2P☆202Updated last week
- ☆633Updated last week