QuentinFuxa / WhisperLiveKitLinks
Simultaneous speech-to-text model
☆8,942Updated last week
Alternatives and similar repositories for WhisperLiveKit
Users that are interested in WhisperLiveKit are comparing it to the libraries listed below
Sorting:
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containeriz…☆9,842Updated 2 months ago
- Open-Source Frontier Voice AI☆11,186Updated this week
- A free, open source, and extensible speech-to-text application that works completely offline.☆7,646Updated last week
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,291Updated 3 weeks ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆9,116Updated this week
- 💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achiev…☆15,936Updated this week
- The python library for real-time communication☆4,432Updated last week
- State-of-the-art TTS model under 25MB 😻☆9,218Updated 3 months ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆2,221Updated 2 months ago
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆4,368Updated 5 months ago
- ⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now,…☆3,435Updated last month
- A research prototype of a human-centered web agent☆8,367Updated last week
- Local-first AI Notepad for Private Meetings☆7,010Updated this week
- LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.☆7,783Updated this week
- 🔥 基于大模型和 RAG 的智能问数系统,对话式数据分析神器。Text-to-SQL Generation via LLMs using RAG.☆4,803Updated this week
- Generate code from the terminal!☆2,599Updated this week
- On-device TTS model by Neuphonic☆4,167Updated 2 weeks ago
- zero-shot voice conversion & singing voice conversion, with real-time support☆3,451Updated 7 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,979Updated last week
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,402Updated 2 weeks ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆5,182Updated 2 months ago
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆23,976Updated 3 weeks ago
- ☆6,040Updated 3 months ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,840Updated last month
- Super Magic. The first open-source all-in-one AI productivity platform (Generalist AI Agent + Workflow Engine + IM + Online collaborative…☆4,369Updated this week
- Voice Activity Detector (VAD) : low-latency, high-performance and lightweight☆1,648Updated this week
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆16,621Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆4,996Updated 4 months ago
- SoTA open-source TTS☆14,907Updated 2 months ago
- Free, local, open-source AI app builder ✨ v0 / lovable / Bolt alternative 🌟 Star if you like it!☆17,746Updated this week