kadirnar / whisper-plusLinks
WhisperPlus: Faster, Smarter, and More Capable π
β1,932Updated last month
Alternatives and similar repositories for whisper-plus
Users that are interested in whisper-plus are comparing it to the libraries listed below
Sorting:
- Incredibly fast Whisper-large-v3β1,880Updated last year
- β1,150Updated 11 months ago
- β610Updated last year
- π Youtube Videos Transcription with OpenAI's Whisperβ560Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ888Updated 7 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,642Updated last year
- Effortlessly add AI-generated transcription subtitles to your videosβ547Updated last year
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ947Updated last year
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,027Updated last year
- Whisper with Medusa headsβ864Updated 5 months ago
- π Youtube Videos Transcription with OpenAI's Whisperβ410Updated last year
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,191Updated last month
- ML-powered speech recognition directly in your browserβ3,223Updated last year
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ2,141Updated 2 months ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,734Updated 4 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,551Updated last month
- Apple PodCast Transcription with OpenAI's Whisperβ347Updated 2 years ago
- Local SRT/LLM/TTS Voicechatβ752Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,668Updated last year
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,803Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ385Updated last year
- TTS with kokoro and onnx runtimeβ2,344Updated last month
- Synchronized Translation for Videos. Video dubbingβ1,311Updated last month
- face-to-stickerβ650Updated last year
- Open source inference code for Rev's modelβ435Updated 9 months ago
- AI Video Search Engine (RAG)β616Updated 10 months ago
- StreamSpeech is an βAll in Oneβ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.β1,232Updated 6 months ago
- Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorchβ1,545Updated 9 months ago
- Examples for Cerebrium Serverless GPUsβ515Updated 3 weeks ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ348Updated last year