kadirnar / whisper-plus
WhisperPlus: Faster, Smarter, and More Capable π
β1,788Updated 2 weeks ago
Alternatives and similar repositories for whisper-plus:
Users that are interested in whisper-plus are comparing it to the libraries listed below
- Incredibly fast Whisper-large-v3β1,852Updated last year
- β1,113Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β4,120Updated 2 months ago
- β591Updated 10 months ago
- TTS with kokoro and onnx runtimeβ1,614Updated last week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,745Updated last month
- Whisper with Medusa headsβ822Updated last week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,580Updated 6 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ584Updated 2 months ago
- HTML to Markdown converter and crawler.β522Updated last year
- π Youtube Videos Transcription with OpenAI's Whisperβ558Updated last year
- Clapper.app, a video synthesizer and sequencer designed for the age of AI cinemaβ2,157Updated 2 weeks ago
- StreamSpeech is an βAll in Oneβ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.β1,027Updated 5 months ago
- Inference and training library for high-quality TTS models.β5,025Updated 2 months ago
- Converts text to speech in realtimeβ2,554Updated this week
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,247Updated last week
- AI Video Search Engine (RAG)β555Updated last month
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,465Updated 6 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ1,751Updated 2 weeks ago
- Foundational model for human-like, expressive TTSβ4,035Updated 6 months ago
- A nearly-live implementation of OpenAI's Whisper.β2,448Updated 2 weeks ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisperβ4,157Updated 2 months ago
- first base model for full-duplex conversational audioβ1,707Updated last month
- Effortlessly add AI-generated transcription subtitles to your videosβ539Updated 3 months ago
- Synchronized Translation for Videos. Video dubbingβ1,022Updated 3 weeks ago
- Open source inference code for Rev's modelβ377Updated last month
- A collection of prompts, system prompts and LLM instructionsβ552Updated 5 months ago
- Interface for OuteTTS models.β926Updated last week
- β8,100Updated 8 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β5,607Updated last month