Simultaneous speech-to-text models
☆10,490Jun 12, 2026Updated 3 weeks ago
Alternatives and similar repositories for WhisperLiveKit
Users that are interested in WhisperLiveKit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,644Nov 12, 2025Updated 7 months ago
- GenBI (Generative BI) for AI agents, an open-source, governed text-to-SQL through an open context layer that turns natural-language quest…☆15,669Updated this week
- SOTA Open Source TTS☆30,996Jun 9, 2026Updated 3 weeks ago
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containeriz…☆11,056Sep 12, 2025Updated 9 months ago
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆9,938Jun 12, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Faster Whisper transcription with CTranslate2☆23,840Nov 19, 2025Updated 7 months ago
- Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization bui…☆12,966Jun 5, 2026Updated 3 weeks ago
- An open source, privacy focused alternative to NotebookLM for teams with no data limits. Join our Discord: https://discord.gg/ejRNvftDp9☆15,115Jun 26, 2026Updated last week
- Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictab…☆18,143Jun 24, 2026Updated last week
- Local, open-source AI app builder for power users ✨ v0 / Lovable / Replit / Bolt alternative 🌟 Star if you like it!☆20,742Jun 26, 2026Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆70,185Updated this week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆28,866Jun 26, 2026Updated last week
- Python tool for converting files and office documents to Markdown.☆159,614Jun 24, 2026Updated last week
- SoTA open-source TTS☆25,271Jun 10, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone☆25,764Jun 25, 2026Updated last week
- 开源白板工具(SaaS),一体化白板,包含思维导图、流程图、自由画等。All in one open-source whiteboard tool with mind, flowchart, freehand and etc.☆14,115Jun 24, 2026Updated last week
- Open source Granola AI Alternative☆8,722Jun 26, 2026Updated last week
- The Company AI Command Center☆19,900Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆26,556Jun 10, 2026Updated 3 weeks ago
- Open-Source Frontier Voice AI☆49,759May 6, 2026Updated last month
- Build, run, and manage agent platforms.☆40,861Jun 26, 2026Updated last week
- The Cursor for Designers • An Open-Source AI-First Design tool • Visually build, style, and edit your React App with AI☆26,073Jun 9, 2026Updated 3 weeks ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆29,534Sep 30, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆22,716Jun 26, 2026Updated last week
- Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-…☆18,699Updated this week
- YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure. Connect to …☆19,529Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,500Jun 9, 2026Updated 3 weeks ago
- Open-source framework for conversational voice AI agents☆10,768Updated this week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆7,431Jun 25, 2026Updated last week
- Vane is an AI-powered answering engine.☆35,489Apr 11, 2026Updated 2 months ago
- Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.☆71,142Updated this week
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆13,210Jun 25, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- real time face swap and one-click video deepfake with only a single image☆94,434Jun 24, 2026Updated last week
- AI video translation & dubbing tool for humans and AI Agents, powered by LLMs. Full pipeline: download, transcribe, translate, TTS dub, r…☆10,409Jun 25, 2026Updated last week
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆35,343Jun 24, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆21,010Jun 13, 2026Updated 2 weeks ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,789Apr 19, 2025Updated last year
- The best way to get AI coding agents to solve hard problems in complex codebases.☆11,070Jun 19, 2026Updated 2 weeks ago
- The API to search, scrape, and interact with the web at scale. 🔥☆140,107Updated this week