xenova / whisper-web
ML-powered speech recognition directly in your browser
β2,854Updated 5 months ago
Alternatives and similar repositories for whisper-web:
Users that are interested in whisper-web are comparing it to the libraries listed below
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitchingβ2,112Updated this week
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β13,242Updated this week
- Open Source framework for voice and multimodal conversational AIβ5,298Updated this week
- Local realtime voice AIβ2,260Updated 3 weeks ago
- WhisperPlus: Faster, Smarter, and More Capable πβ1,803Updated 3 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,640Updated 3 weeks ago
- Whisper with Medusa headsβ821Updated 3 weeks ago
- An Open Source text-to-speech system built by inverting Whisper.β4,166Updated 3 months ago
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,788Updated last year
- A fast multimodal LLM for real-time voiceβ3,757Updated last month
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β5,801Updated 3 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translationβwhere one waits fβ¦β926Updated last month
- TTS with kokoro and onnx runtimeβ1,789Updated 3 weeks ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,560Updated 7 months ago
- A collection of π€ Transformers.js demos and example applicationsβ1,338Updated 3 weeks ago
- Inference and training library for high-quality TTS models.β5,161Updated 3 months ago
- β1,582Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,633Updated 2 months ago
- https://hf.co/hexgrad/Kokoro-82Mβ1,825Updated last week
- On-device Speech Recognition for Apple Siliconβ4,417Updated last month
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β14,656Updated this week
- Converts text to speech in realtimeβ2,727Updated this week
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), withβ¦β3,493Updated this week
- β8,233Updated 9 months ago
- Open source Claude Artifacts β built with Llama 3.1 405Bβ5,715Updated 2 months ago
- Faster Whisper transcription with CTranslate2β14,975Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,588Updated 7 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ3,913Updated 2 weeks ago
- β1,199Updated 6 months ago
- Convert any PDF into a podcast episode!β2,152Updated 3 months ago