LAION-AI / natural_voice_assistantLinks
☆497Updated last year
Alternatives and similar repositories for natural_voice_assistant
Users that are interested in natural_voice_assistant are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆371Updated last year
- ☆1,150Updated 11 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- ☆207Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆380Updated 2 years ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆712Updated 7 months ago
- Whisper with Medusa heads☆865Updated 5 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆220Updated 3 weeks ago
- Fine Tune the Style-TTS2 Voice Model☆266Updated 7 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆539Updated last year
- ☆100Updated last year
- Command Your World with Voice☆801Updated 7 months ago
- ☆258Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- ☆359Updated last year
- ☆157Updated 2 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆258Updated 3 years ago
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- Pybind11 bindings for Whisper.cpp☆344Updated last year
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆314Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆216Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆535Updated 2 years ago
- ☆275Updated last year
- ☆175Updated 2 years ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆189Updated last year
- An AI assistant beyond the chat box.☆329Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆231Updated 11 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Little AI roleplay program☆61Updated 2 years ago