xenova / whisper-webLinks
ML-powered speech recognition directly in your browser
β3,211Updated last year
Alternatives and similar repositories for whisper-web
Users that are interested in whisper-web are comparing it to the libraries listed below
Sorting:
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ3,072Updated last month
- WhisperPlus: Faster, Smarter, and More Capable πβ1,930Updated last month
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,018Updated last year
- Voice activity detector (VAD) for the browser with a simple APIβ1,776Updated 2 weeks ago
- Inference and training library for high-quality TTS models.β5,504Updated last year
- A collection of π€ Transformers.js demos and example applicationsβ1,915Updated last month
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,804Updated last year
- Local realtime voice AIβ2,422Updated last month
- TTS with kokoro and onnx runtimeβ2,335Updated 3 weeks ago
- Local SRT/LLM/TTS Voicechatβ749Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,643Updated last year
- β2,808Updated this week
- A fast multimodal LLM for real-time voiceβ4,317Updated last month
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ944Updated last year
- MARS5 speech model (TTS) from CAMB.AIβ2,810Updated last year
- Converts text to speech in realtimeβ3,718Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β4,549Updated last month
- A nearly-live implementation of OpenAI's Whisper.β3,734Updated this week
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β15,209Updated this week
- Whisper with Medusa headsβ863Updated 5 months ago
- first base model for full-duplex conversational audioβ1,770Updated last year
- Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.β2,676Updated last year
- β1,353Updated 9 months ago
- Incredibly fast Whisper-large-v3β1,881Updated last year
- Interface for OuteTTS models.β1,419Updated 6 months ago
- Yes, it's another chat over documents implementation... but this one is entirely local!β1,812Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,268Updated 9 months ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), withβ¦β5,823Updated last month
- Foundational model for human-like, expressive TTSβ4,192Updated last year
- Convert any PDF into a podcast episode!β2,551Updated last year