ML-powered speech recognition directly in your browser
β3,253Oct 1, 2024Updated last year
Alternatives and similar repositories for whisper-web
Users that are interested in whisper-web are comparing it to the libraries listed below
Sorting:
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β15,518Updated this week
- High-performance In-browser LLM Inference Engineβ17,515Mar 2, 2026Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,556Feb 22, 2026Updated 2 weeks ago
- Faster Whisper transcription with CTranslate2β21,289Nov 19, 2025Updated 3 months ago
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, alβ¦β17,168Updated this week
- β8,826Oct 25, 2025Updated 4 months ago
- Port of OpenAI's Whisper model in C/C++β47,262Updated this week
- A fast multimodal LLM for real-time voiceβ4,368Dec 12, 2025Updated 2 months ago
- SOTA Open Source TTSβ25,154Updated this week
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,804Feb 27, 2024Updated 2 years ago
- An open-source RAG-based tool for chatting with your documents.β25,193Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.β40,860Updated this week
- On-device Speech Recognition for Apple Siliconβ5,731Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β10,096May 8, 2025Updated 10 months ago
- An AI-powered search engine with a generative UIβ8,636Updated this week
- Open source Claude Artifacts β built with Llama 3.1 405Bβ6,886Mar 2, 2026Updated last week
- Perplexity Inspired Answer Engineβ5,016Jun 27, 2025Updated 8 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.β36,049Apr 19, 2025Updated 10 months ago
- A framework for building realtime voice AI agents π€ποΈπΉβ9,562Updated this week
- Build AI Agents, Visuallyβ50,539Updated this week
- Perplexica is an AI-powered answering engine.β30,120Feb 13, 2026Updated 3 weeks ago
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.β6,194Feb 28, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,392Mar 1, 2026Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,985Sep 30, 2025Updated 5 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. β¦β11,492Feb 10, 2026Updated 3 weeks ago
- Build, run, manage agentic software at scale.β38,516Updated this week
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ89,344Updated this week
- The Frontend for Agents & Generative UI. React + Angularβ29,208Updated this week
- Inference and training library for high-quality TTS models.β5,547Dec 10, 2024Updated last year
- π Text-Prompted Generative Audio Modelβ39,039Aug 19, 2024Updated last year
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ61,332Updated this week
- Open Source framework for voice and multimodal conversational AIβ10,529Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomβ¦β4,214Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β21,456Updated this week
- β¨ The Next Gen Airtable Alternative: No-Code Postgresβ20,984Updated this week
- The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configration.β55,868Updated this week
- The easiest way to use Agentic RAG in any enterpriseβ4,405Jan 22, 2025Updated last year
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β9,799Updated this week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,049Jan 8, 2025Updated last year