xenova / whisper-web
ML-powered speech recognition directly in your browser
☆1,473Updated 3 months ago
Related projects: ⓘ
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,509Updated last month
- WhisperPlus: Faster, Smarter, and More Capable 🚀☆1,679Updated last month
- Cross-Platform, GPU Accelerated Whisper 🏎️☆1,671Updated 6 months ago
- Incredibly fast Whisper-large-v3☆1,830Updated 7 months ago
- Open Source framework for voice and multimodal conversational AI☆3,044Updated this week
- Whisper with Medusa heads☆774Updated last week
- An Open Source text-to-speech system built by inverting Whisper.☆3,772Updated 3 months ago
- Build real-time multimodal AI applications 🤖🎙️📹☆1,053Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,398Updated last month
- Inference and training library for high-quality TTS models.☆4,193Updated last month
- ☆1,079Updated 2 months ago
- 🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser☆429Updated last month
- Local SRT/LLM/TTS Voicechat☆471Updated last month
- A nearly-live implementation of OpenAI's Whisper.☆1,798Updated 2 weeks ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,495Updated 2 months ago
- Converts text to speech in realtime☆1,730Updated 3 weeks ago
- Example UI implementing the RTVI web client☆468Updated last month
- MARS5 speech model (TTS) from CAMB.AI☆2,440Updated last month
- ☆740Updated 5 months ago
- Foundational model for human-like, expressive TTS☆3,721Updated last month
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3☆1,050Updated last week
- Voice activity detector (VAD) for the browser with a simple API☆773Updated last month
- An AI-powered search engine with a generative UI☆5,925Updated last week
- Whisper realtime streaming for long speech-to-text transcription and translation☆1,770Updated 2 weeks ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,542Updated 2 weeks ago
- AIlice is a fully autonomous, general-purpose AI agent.☆790Updated this week
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆650Updated 2 months ago
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,581Updated 3 weeks ago
- ☆384Updated this week
- ☆567Updated 5 months ago