LAION-AI / natural_voice_assistantLinks
☆495Updated last year
Alternatives and similar repositories for natural_voice_assistant
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆371Updated last year
- ☆207Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆688Updated 4 months ago
- Command Your World with Voice☆765Updated 4 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 2 months ago
- ☆99Updated last year
- Whisper with Medusa heads☆861Updated 2 months ago
- Fine Tune the Style-TTS2 Voice Model☆254Updated 4 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- ☆1,136Updated 8 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- ☆346Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆158Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆474Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆253Updated 2 years ago
- llama.cpp with BakLLaVA model describes what does it see☆382Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- ☆261Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,632Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆186Updated 6 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆520Updated last year
- 🐮📢 The first AI voice assistant that interrupts *you*☆149Updated last year
- Simulates talk with an AI that can express emotions☆80Updated 4 months ago
- A ggml (C++) re-implementation of tortoise-tts☆190Updated last year
- ☆157Updated 2 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated 11 months ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆306Updated 4 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆316Updated last year
- Realtime demo, Streaming and Finetuning code for CSM☆405Updated last month
- Interface for OuteTTS models.☆1,390Updated 4 months ago