QuentinFuxa / WhisperLiveKitLinks
Simultaneous speech-to-text model
☆8,419Updated last week
Alternatives and similar repositories for WhisperLiveKit
Users that are interested in WhisperLiveKit are comparing it to the libraries listed below
Sorting:
- A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local …☆8,254Updated last week
- Frontier Open-Source Text-to-Speech☆9,961Updated 2 months ago
- A free, open source, and extensible speech-to-text application that works completely offline.☆6,349Updated last week
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containeriz…☆9,657Updated 2 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,924Updated this week
- The python library for real-time communication☆4,392Updated last month
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆16,176Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆4,792Updated 3 months ago
- An Open Source implementation of Notebook LM with more flexibility and features☆10,009Updated 2 weeks ago
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,141Updated last week
- ☆6,028Updated 2 months ago
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆4,347Updated 4 months ago
- Towards Human-Sounding Speech☆5,729Updated 6 months ago
- Voice Activity Detector (VAD) : low-latency, high-performance and lightweight☆1,572Updated last week
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆8,903Updated 4 months ago
- Generate audiobooks from e-books☆5,662Updated 8 months ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆5,110Updated last month
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,684Updated 2 weeks ago
- zero-shot voice conversion & singing voice conversion, with real-time support☆3,412Updated 6 months ago
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!☆11,080Updated last month
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆8,818Updated this week
- On-device TTS model by Neuphonic☆3,965Updated 2 weeks ago
- Send a phone call from AI agent, in an API call. Or, directly call the bot from the configured phone number!☆2,397Updated 3 weeks ago
- ⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now,…☆3,376Updated 3 weeks ago
- Open source alternative to NotebookLM, Perplexity, and Glean. Connects to search engines, Slack, Linear, Jira, ClickUp, Notion, YouTube, …☆10,659Updated this week
- 💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achiev…☆15,666Updated this week
- Generate audiobooks from EPUBs, PDFs and text with synchronized captions.☆3,837Updated 2 weeks ago
- Local-first AI Notepad for Private Meetings☆6,624Updated this week
- ☆2,591Updated this week
- "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"☆10,320Updated this week