LAION-AI / natural_voice_assistant
☆483Updated 11 months ago
Alternatives and similar repositories for natural_voice_assistant:
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
- Joint speech-language model - respond directly to audio!☆369Updated 10 months ago
- ☆1,126Updated 2 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,598Updated 9 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆763Updated 8 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆495Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆628Updated 8 months ago
- ☆223Updated last month
- Command Your World with Voice☆659Updated 4 months ago
- ☆255Updated last year
- Whisper with Medusa heads☆832Updated last week
- ☆96Updated last year
- ☆285Updated 10 months ago
- ☆353Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated 9 months ago
- ☆326Updated 10 months ago
- Python bindings for whisper.cpp☆246Updated 2 weeks ago
- ☆204Updated 11 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆402Updated 8 months ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆650Updated 7 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- first base model for full-duplex conversational audio☆1,737Updated 4 months ago
- Interface for OuteTTS models.☆1,205Updated last week
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆696Updated 11 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆736Updated 2 months ago
- ☆269Updated 10 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆189Updated 2 months ago
- Open source inference code for Rev's model☆402Updated last week
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆581Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 8 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆209Updated 6 months ago