KoljaB / RealtimeVoiceChatLinks
Have a natural, spoken conversation with AI!
☆2,764Updated last week
Alternatives and similar repositories for RealtimeVoiceChat
Users that are interested in RealtimeVoiceChat are comparing it to the libraries listed below
Sorting:
- BrowserOS is an open-source agentic web browser.☆1,987Updated this week
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…☆1,059Updated last month
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆1,884Updated last week
- ☆787Updated this week
- Local realtime voice AI☆2,336Updated 4 months ago
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,302Updated last month
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆854Updated 4 months ago
- first base model for full-duplex conversational audio☆1,746Updated 6 months ago
- The most accurate document search and store for building AI apps☆2,799Updated this week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆775Updated 11 months ago
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.☆577Updated last week
- Towards Human-Sounding Speech☆5,229Updated 2 months ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆4,304Updated last month
- SoTA open-source TTS☆9,357Updated last month
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,793Updated 2 months ago
- Realtime AI speech with OpenAI Realtime API and Gemini Live API on Arduino ESP32 with Secure Websockets and Deno edge functions with >15 …☆1,067Updated this week
- A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption☆2,234Updated this week
- ☆2,083Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆3,577Updated last week
- A real-time silent speech recognition tool.☆518Updated 5 months ago
- AI-powered multi-agent builder☆3,387Updated this week
- 🐝 AI-powered browser assistant ("Cline for web browsing")☆788Updated last month
- A self-hosted API that takes a URL and returns a file with browser screenshots.☆985Updated 4 months ago
- Open Source framework for voice and multimodal conversational AI☆6,805Updated this week
- Enable AI models for video production in the browser☆1,901Updated last month
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,228Updated 3 months ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,957Updated last month
- Kanban board to manage your AI coding agents☆486Updated this week
- Backlog.md - A tool for managing project collaboration between humans and AI Agents in a git ecosystem☆2,200Updated this week
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆657Updated last month