Simultaneous speech-to-text models
☆9,918Mar 13, 2026Updated this week
Alternatives and similar repositories for WhisperLiveKit
Users that are interested in WhisperLiveKit are comparing it to the libraries listed below
Sorting:
- SOTA Open Source TTS☆27,364Updated this week
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆9,551Updated this week
- ⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-p…☆14,577Updated this week
- Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization bui…☆10,314Mar 3, 2026Updated last week
- Open source alternative to NotebookLM for teams. Join our Discord: https://discord.gg/ejRNvftDp9☆13,246Updated this week
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containeriz…☆10,546Sep 12, 2025Updated 6 months ago
- Faster Whisper transcription with CTranslate2☆21,443Nov 19, 2025Updated 3 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆61,687Mar 9, 2026Updated last week
- Python tool for converting files and office documents to Markdown.☆90,728Updated this week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆26,992Updated this week
- Build, run, manage agentic software at scale.☆38,700Updated this week
- The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling cus…☆17,819Updated this week
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, al…☆17,168Mar 9, 2026Updated last week
- Open-source framework for conversational voice AI agents☆10,249Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,094Mar 7, 2026Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆28,006Sep 30, 2025Updated 5 months ago
- An open-source RAG-based tool for chatting with your documents.☆25,193Mar 8, 2026Updated last week
- Kortix – build, manage and train AI Agents.☆19,497Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆25,499Mar 2, 2026Updated last week
- The Cursor for Designers • An Open-Source AI-First Design tool • Visually build, style, and edit your React App with AI☆24,872Feb 27, 2026Updated 2 weeks ago
- Local, open-source AI app builder for power users ✨ v0 / Lovable / Replit / Bolt alternative 🌟 Star if you like it!☆19,883Updated this week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆93,251Updated this week
- Open-Source Frontier Voice AI☆23,713Mar 6, 2026Updated last week
- 开源白板工具(SaaS),一体化白板,包含思维导图、流程图、自由画等。All in one open-source whiteboard tool with mind, flowchart, freehand and etc.☆13,295Mar 3, 2026Updated last week
- SoTA open-source TTS☆23,211Mar 9, 2026Updated last week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆6,239Updated this week
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆10,798Updated this week
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆33,400Mar 6, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,431Mar 1, 2026Updated 2 weeks ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,139Feb 28, 2026Updated 2 weeks ago
- Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process depl…☆9,708Feb 8, 2026Updated last month
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆55,756Mar 7, 2026Updated last week
- AI notepad for meetings☆7,944Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,085Apr 19, 2025Updated 10 months ago
- Vane is an AI-powered answering engine.☆32,660Updated this week
- real time face swap and one-click video deepfake with only a single image☆79,950Mar 6, 2026Updated last week
- The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harne…☆73,318Mar 9, 2026Updated last week
- The python library for real-time communication☆4,548Jan 12, 2026Updated 2 months ago
- Universal memory layer for AI Agents☆49,365Updated this week