LAION-AI / natural_voice_assistant
☆461Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for natural_voice_assistant
- Joint speech-language model - respond directly to audio!☆356Updated 4 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆518Updated 3 months ago
- ☆1,094Updated 5 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆718Updated 3 months ago
- Command Your World with Voice☆443Updated this week
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆730Updated this week
- Interface for OuteTTS models.☆406Updated 2 weeks ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆444Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆138Updated 4 months ago
- ☆253Updated 8 months ago
- A ggml (C++) re-implementation of tortoise-tts☆159Updated 3 months ago
- Whisper with Medusa heads☆799Updated 3 weeks ago
- first base model for full-duplex conversational audio☆1,560Updated last week
- ☆152Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆322Updated 5 months ago
- ☆273Updated 3 months ago
- Performant and accurate speech recognition built on Pytorch☆248Updated 2 years ago
- Collection of Open Source Speech Data☆146Updated last week
- ☆253Updated 5 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆157Updated last month
- ☆176Updated last month
- ☆87Updated 6 months ago
- Implementation of F5-TTS in MLX☆327Updated 2 weeks ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆614Updated last month
- A fast multimodal LLM for real-time voice☆1,339Updated this week
- ☆307Updated 2 months ago
- llama.cpp with BakLLaVA model describes what does it see☆380Updated last year
- ☆256Updated 5 months ago
- A multimodal, function calling powered LLM webui.☆208Updated last month
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,547Updated 3 months ago