kadirnar / whisper-plus
WhisperPlus: Faster, Smarter, and More Capable π
β1,722Updated 2 weeks ago
Related projects β
Alternatives and complementary repositories for whisper-plus
- Incredibly fast Whisper-large-v3β1,846Updated 9 months ago
- β1,095Updated 5 months ago
- Whisper with Medusa headsβ800Updated 3 weeks ago
- first base model for full-duplex conversational audioβ1,586Updated last week
- β746Updated 7 months ago
- StreamSpeech is an βAll in Oneβ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.β960Updated 3 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ326Updated 5 months ago
- Local SRT/LLM/TTS Voicechatβ548Updated last month
- Inference and training library for high-quality TTS models.β4,663Updated 3 weeks ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ337Updated 2 months ago
- Converts text to speech in realtimeβ2,041Updated this week
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ736Updated last month
- ML-powered speech recognition directly in your browserβ2,600Updated last month
- β575Updated 7 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β4,840Updated 3 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,548Updated 3 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,621Updated 3 weeks ago
- π Awesome list for Whisper β an open-source AI-powered speech recognition system developed by OpenAIβ1,275Updated 6 months ago
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAIβs Whisper model.β316Updated last month
- Open source inference code for Rev's modelβ335Updated last week
- εΊδΊBert-VITS2εη葨ζ γε¨η»ζ΅θ―. Animation testing based on Bert-VITS2.β517Updated 2 months ago
- A collection of prompts, system prompts and LLM instructionsβ483Updated 2 months ago
- Clapper.app, a video synthesizer and sequencer designed for the age of AI cinemaβ2,078Updated this week
- β7,759Updated 5 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".β801Updated 3 weeks ago
- π Youtube Videos Transcription with OpenAI's Whisperβ555Updated last year
- Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Coβ¦β1,311Updated this week
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech β¦β934Updated this week
- Controllable and fast Text-to-Speech for over 7000 languages!β1,465Updated 2 weeks ago
- Apple PodCast Transcription with OpenAI's Whisperβ341Updated 11 months ago