xenova/whisper-web

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xenova/whisper-web)

xenova / whisper-web

ML-powered speech recognition directly in your browser

☆3,338

Alternatives and similar repositories for whisper-web

Users that are interested in whisper-web are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / transformers.js
View on GitHub
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
☆16,213Updated this week
mlc-ai / web-llm
View on GitHub
High-performance In-browser LLM Inference Engine
☆18,468Jun 9, 2026Updated last month
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,304Jul 13, 2026Updated 2 weeks ago
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,595Nov 19, 2025Updated 8 months ago
FL33TW00D / whisper-turbo
View on GitHub
Cross-Platform, GPU Accelerated Whisper 🏎️
☆1,791Feb 27, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆52,364Jul 11, 2026Updated 2 weeks ago
screenpipe / screenpipe
View on GitHub
YC (S26) | Record your screen 24/7 and plug into your agents. Local, private, secure. Connect to OpenClaw, Hermes agent and 100+ apps
☆20,590Updated this week
fixie-ai / ultravox
View on GitHub
A fast multimodal LLM for real-time voice
☆4,499Dec 12, 2025Updated 7 months ago
Vaibhavs10 / insanely-fast-whisper
View on GitHub
☆12,997Oct 25, 2025Updated 9 months ago
Cinnamon / kotaemon
View on GitHub
An open-source RAG-based tool for chatting with your documents.
☆25,663Jul 14, 2026Updated 2 weeks ago
janhq / jan
View on GitHub
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
☆43,741Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,398Updated this week
argmaxinc / argmax-oss-swift
View on GitHub
On-device Speech AI for Apple Silicon
☆6,290Jul 13, 2026Updated 2 weeks ago
miurla / morphic
View on GitHub
An AI-powered search engine with a generative UI
☆9,007Jul 22, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,167Updated this week
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆37,038Apr 19, 2025Updated last year
FlowiseAI / Flowise
View on GitHub
Build AI Agents, Visually
☆54,998Updated this week
jina-ai / reader
View on GitHub
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
☆11,747May 22, 2026Updated 2 months ago
Nutlope / llamacoder
View on GitHub
Open source Claude Artifacts – built with Llama 3.1 405B
☆7,035Jul 18, 2026Updated last week
developersdigest / llm-answer-engine
View on GitHub
Perplexity Inspired Answer Engine
☆5,035Apr 29, 2026Updated 2 months ago
huggingface / transformers.js-examples
View on GitHub
A collection of 🤗 Transformers.js demos and example applications
☆2,072Feb 17, 2026Updated 5 months ago
e2b-dev / fragments
View on GitHub
Open-source Next.js template for building apps that are fully generated by AI. By E2B.
☆6,357Updated this week
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,214Aug 19, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
huggingface / distil-whisper
View on GitHub
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
☆4,099Jan 8, 2025Updated last year
stanford-oval / storm
View on GitHub
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
☆30,368Sep 30, 2025Updated 9 months ago
zaidmukaddam / scira
View on GitHub
Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …
☆11,815Mar 20, 2026Updated 4 months ago
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,472Updated this week
ItzCrazyKns / Vane
View on GitHub
Vane is an AI-powered answering engine.
☆35,896Apr 11, 2026Updated 3 months ago
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,580Dec 10, 2024Updated last year
teableio / teable
View on GitHub
✨ AI Spreadsheet for Business
☆21,561Updated this week
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,747May 16, 2026Updated 2 months ago
langchain-ai / open-canvas
View on GitHub
📃 A better UX for chat, writing content, and coding with LLMs.
☆5,495Feb 25, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
openinterpreter / openinterpreter
View on GitHub
A coding agent for open models like Kimi K3
☆67,376Updated this week
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,827Aug 16, 2024Updated last year
browserbase / stagehand
View on GitHub
The SDK For Browser Agents
☆23,659Updated this week
kadirnar / whisper-plus
View on GitHub
WhisperPlus: Faster, Smarter, and More Capable 🚀
☆1,955May 4, 2026Updated 2 months ago
Mintplex-Labs / anything-llm
View on GitHub
Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience
☆64,015Updated this week
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆105,839Apr 15, 2026Updated 3 months ago
rashadphz / farfalle
View on GitHub
🔍 AI search engine - self-host with local or cloud LLMs
☆3,535Sep 27, 2024Updated last year