kadirnar / whisper-plusLinks
WhisperPlus: Faster, Smarter, and More Capable π
β1,859Updated last week
Alternatives and similar repositories for whisper-plus
Users that are interested in whisper-plus are comparing it to the libraries listed below
Sorting:
- Incredibly fast Whisper-large-v3β1,878Updated last year
- β1,133Updated 5 months ago
- β604Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,618Updated 11 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ773Updated last month
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,913Updated 6 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,312Updated last month
- π Youtube Videos Transcription with OpenAI's Whisperβ564Updated last year
- Whisper with Medusa headsβ849Updated last week
- StreamSpeech is an βAll in Oneβ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.β1,116Updated 2 weeks ago
- ML-powered speech recognition directly in your browserβ2,989Updated 9 months ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ890Updated 9 months ago
- Inference and training library for high-quality TTS models.β5,349Updated 7 months ago
- AI Video Search Engine (RAG)β588Updated 4 months ago
- Local SRT/LLM/TTS Voicechatβ696Updated 9 months ago
- TTS with kokoro and onnx runtimeβ2,093Updated 3 weeks ago
- Controllable and fast Text-to-Speech for over 7000 languages!β1,622Updated 2 weeks ago
- Synchronized Translation for Videos. Video dubbingβ1,178Updated 2 weeks ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ363Updated last year
- Effortlessly add AI-generated transcription subtitles to your videosβ545Updated 8 months ago
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,802Updated last year
- Open source inference code for Rev's modelβ412Updated 2 months ago
- Foundational model for human-like, expressive TTSβ4,136Updated 11 months ago
- MemoAI Video to translated text, subtitles and notes made easy.β649Updated 2 months ago
- Apple PodCast Transcription with OpenAI's Whisperβ349Updated last year
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,074Updated last month
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β6,265Updated 6 months ago
- Examples for Cerebrium Serverless GPUsβ499Updated 3 weeks ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.β742Updated last year
- εΊδΊBert-VITS2εη葨ζ γε¨η»ζ΅θ―. Animation testing based on Bert-VITS2.β531Updated 4 months ago