LAION-AI / natural_voice_assistant
☆476Updated 8 months ago
Alternatives and similar repositories for natural_voice_assistant:
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
- Joint speech-language model - respond directly to audio!☆365Updated 6 months ago
- ☆1,108Updated 7 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆564Updated 5 months ago
- Command Your World with Voice☆560Updated last month
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,575Updated 5 months ago
- ☆255Updated 10 months ago
- Whisper with Medusa heads☆819Updated 3 weeks ago
- ☆195Updated 3 months ago
- Interface for OuteTTS models.☆899Updated last week
- A multimodal, function calling powered LLM webui.☆213Updated 4 months ago
- ☆196Updated 8 months ago
- llama.cpp with BakLLaVA model describes what does it see☆380Updated last year
- ☆269Updated 7 months ago
- 🐮📢 The first AI voice assistant that interrupts *you*☆138Updated 4 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆471Updated last year
- Python bindings for whisper.cpp☆210Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆150Updated 6 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆352Updated 5 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆238Updated 2 years ago
- ☆153Updated last year
- ☆90Updated 9 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆189Updated 4 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆766Updated 2 months ago
- function calling-based LLM agents☆283Updated 4 months ago
- Implementation of F5-TTS in MLX☆448Updated last week
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆300Updated last month
- Local semantic search. Stupidly simple.☆407Updated 6 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆186Updated 3 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆161Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆175Updated 5 months ago