xenova / whisper-webLinks
ML-powered speech recognition directly in your browser
β2,972Updated 8 months ago
Alternatives and similar repositories for whisper-web
Users that are interested in whisper-web are comparing it to the libraries listed below
Sorting:
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,801Updated last year
- β1,982Updated this week
- An Open Source text-to-speech system built by inverting Whisper.β4,286Updated 2 weeks ago
- Inference and training library for high-quality TTS models.β5,303Updated 6 months ago
- A fast multimodal LLM for real-time voiceβ4,030Updated 4 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,757Updated last month
- Local realtime voice AIβ2,328Updated 3 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,887Updated 5 months ago
- Record voice notes & transcribe, summarize, and get tasksβ1,950Updated 3 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,613Updated 10 months ago
- β1,134Updated 4 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ741Updated 2 weeks ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), withβ¦β3,717Updated 3 weeks ago
- A collection of π€ Transformers.js demos and example applicationsβ1,603Updated 2 weeks ago
- A nearly-live implementation of OpenAI's Whisper.β3,004Updated 3 weeks ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,052Updated last month
- first base model for full-duplex conversational audioβ1,749Updated 5 months ago
- WhisperPlus: Faster, Smarter, and More Capable πβ1,849Updated this week
- Local AI API Platformβ2,753Updated last week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,603Updated last year
- β8,472Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,997Updated 5 months ago
- Faster Whisper transcription with CTranslate2β16,692Updated 3 weeks ago
- Convert any PDF into a podcast episode!β2,347Updated 6 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ6,080Updated last week
- β1,274Updated 2 months ago
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β13,851Updated this week
- Whisper with Medusa headsβ842Updated 3 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.β3,058Updated this week
- On-device Speech Recognition for Apple Siliconβ4,718Updated last week