lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆534Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for voicechat2
- ☆442Updated this week
- Example UI implementing the RTVI web client☆471Updated 3 weeks ago
- Whisper with Medusa heads☆800Updated last week
- first base model for full-duplex conversational audio☆1,146Updated this week
- Open source inference code for Rev's model☆326Updated last week
- ⚡ Insanely fast AI voice assistant with <500ms response times☆298Updated 2 months ago
- Interface for OuteTTS models.☆277Updated this week
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆682Updated this week
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆753Updated last week
- Have a natural voice conversation with an LLM☆222Updated this week
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆940Updated 2 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆714Updated 2 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆358Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,541Updated 3 months ago
- Implementation of F5-TTS in MLX☆309Updated last week
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech,…☆304Updated this week
- Llama3.1 learns to Listen☆1,702Updated this week
- podcastfy.ai gradio demo app☆304Updated last week
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆453Updated 2 months ago
- ☆711Updated this week
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,201Updated 2 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆249Updated 2 months ago
- ☆294Updated 4 months ago
- Voice Transformation for Videos. 🎤👄🎬☆216Updated 3 weeks ago
- ☆283Updated 3 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆503Updated 2 months ago
- Yet another open source Perplexity☆361Updated 2 weeks ago
- A fast multimodal LLM for real-time voice☆973Updated this week
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Co…☆950Updated this week
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆531Updated this week