xenova / whisper-webView external linksLinks
ML-powered speech recognition directly in your browser
β3,241Oct 1, 2024Updated last year
Alternatives and similar repositories for whisper-web
Users that are interested in whisper-web are comparing it to the libraries listed below
Sorting:
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β15,352Updated this week
- High-performance In-browser LLM Inference Engineβ17,258Feb 9, 2026Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,051Feb 8, 2026Updated last week
- Faster Whisper transcription with CTranslate2β20,951Nov 19, 2025Updated 2 months ago
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, alβ¦β16,810Updated this week
- β8,809Oct 25, 2025Updated 3 months ago
- Port of OpenAI's Whisper model in C/C++β46,720Feb 9, 2026Updated last week
- A fast multimodal LLM for real-time voiceβ4,350Dec 12, 2025Updated 2 months ago
- SOTA Open Source TTSβ24,863Feb 2, 2026Updated last week
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,804Feb 27, 2024Updated last year
- An open-source RAG-based tool for chatting with your documents.β25,019Jul 4, 2025Updated 7 months ago
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.β40,419Updated this week
- On-device Speech Recognition for Apple Siliconβ5,597Jan 28, 2026Updated 2 weeks ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β9,794May 8, 2025Updated 9 months ago
- An AI-powered search engine with a generative UIβ8,558Updated this week
- Open source Claude Artifacts β built with Llama 3.1 405Bβ6,862Feb 8, 2026Updated last week
- Perplexity Inspired Answer Engineβ5,017Jun 27, 2025Updated 7 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,918Apr 19, 2025Updated 9 months ago
- A framework for building realtime voice AI agents π€ποΈπΉβ9,324Updated this week
- Build AI Agents, Visuallyβ49,025Updated this week
- Perplexica is an AI-powered answering engine.β28,872Jan 10, 2026Updated last month
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.β6,160Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,263Feb 4, 2026Updated last week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. β¦β11,412Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,897Sep 30, 2025Updated 4 months ago
- Build multi-agent systems that learn and improve with every interaction.β37,691Feb 9, 2026Updated last week
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ80,940Updated this week
- The Frontend for Agents. Connect any agent framework to Chat, Generative UI, Frontend Tools, Human-in-the-Loop and Shared State. React & β¦β28,762Updated this week
- Inference and training library for high-quality TTS models.β5,528Dec 10, 2024Updated last year
- π Text-Prompted Generative Audio Modelβ38,970Aug 19, 2024Updated last year
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ59,947Updated this week
- Open Source framework for voice and multimodal conversational AIβ10,263Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β4,188Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β21,024Jan 29, 2026Updated 2 weeks ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.β54,397Updated this week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,039Jan 8, 2025Updated last year
- β¨ The Next Gen Airtable Alternative: No-Code Postgresβ20,869Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β35,968Updated this week
- The easiest way to use Agentic RAG in any enterpriseβ4,398Jan 22, 2025Updated last year