KoljaB / RealtimeVoiceChat
Have a natural, spoken conversation with AI!
☆2,139Updated last week
Alternatives and similar repositories for RealtimeVoiceChat
Users that are interested in RealtimeVoiceChat are comparing it to the libraries listed below
Sorting:
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…☆1,006Updated last month
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,110Updated last week
- Local realtime voice AI☆2,290Updated 2 months ago
- A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption☆2,059Updated this week
- Towards Human-Sounding Speech☆4,750Updated last week
- ☆721Updated 3 weeks ago
- Open source multi-modal RAG for building AI apps over private knowledge.☆2,266Updated this week
- first base model for full-duplex conversational audio☆1,741Updated 4 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆845Updated 2 months ago
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.☆491Updated last week
- Browser-LLM Auto-Scaling Technology☆507Updated last week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆765Updated 9 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,124Updated this week
- Minimal AI agent framework that just works with only seven tools.☆448Updated this week
- Artificial Neural Engine Machine Learning Library☆914Updated this week
- Onit MacOS client☆811Updated this week
- Browser MCP is a Model Context Provider (MCP) server that allows AI applications to control your browser☆1,475Updated 3 weeks ago
- Local Deep Research is an AI-powered assistant that transforms complex questions into comprehensive, cited reports by conducting iterativ…☆2,653Updated this week
- Realtime AI speech with OpenAI Realtime API on Arduino ESP32 with Secure Websockets and Deno edge functions with >10min uninterrupted con…☆960Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,701Updated this week
- I made my AI think harder by making it argue with itself repeatedly. It works stupidly well.☆2,059Updated 2 weeks ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,918Updated 3 weeks ago
- A real-time silent speech recognition tool.☆493Updated 3 months ago
- Open Source framework for voice and multimodal conversational AI☆6,065Updated this week
- Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.☆591Updated 2 months ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆3,668Updated last week
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,030Updated last month
- A self-hosted API that takes a URL and returns a file with browser screenshots.☆970Updated 2 months ago
- Enable AI models for video production in the browser☆1,637Updated last month
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆637Updated 2 weeks ago