kadirnar / whisper-plusLinks
WhisperPlus: Faster, Smarter, and More Capable π
β1,925Updated 2 weeks ago
Alternatives and similar repositories for whisper-plus
Users that are interested in whisper-plus are comparing it to the libraries listed below
Sorting:
- Incredibly fast Whisper-large-v3β1,878Updated last year
- β1,140Updated 10 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,643Updated last year
- An Open Source text-to-speech system built by inverting Whisper.β4,539Updated 6 months ago
- π Youtube Videos Transcription with OpenAI's Whisperβ560Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ878Updated 6 months ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ940Updated last year
- β609Updated last year
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,998Updated 11 months ago
- ML-powered speech recognition directly in your browserβ3,184Updated last year
- Whisper with Medusa headsβ864Updated 4 months ago
- AI Video Search Engine (RAG)β613Updated 9 months ago
- Local SRT/LLM/TTS Voicechatβ744Updated last year
- Apple PodCast Transcription with OpenAI's Whisperβ347Updated 2 years ago
- Effortlessly add AI-generated transcription subtitles to your videosβ546Updated last year
- TTS with kokoro and onnx runtimeβ2,290Updated 5 months ago
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,805Updated last year
- Inference and training library for high-quality TTS models.β5,490Updated last year
- β8,746Updated last month
- HTML to Markdown converter and crawler.β603Updated last year
- StreamSpeech is an βAll in Oneβ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.β1,205Updated 5 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,160Updated 3 weeks ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,038Updated 11 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,647Updated last year
- A Fast TTS Engineβ599Updated 10 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ383Updated last year
- Effort to open-source NLLB checkpoints.β467Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,698Updated 3 months ago
- first base model for full-duplex conversational audioβ1,769Updated 11 months ago
- face-to-stickerβ649Updated last year