FL33TW00D / whisper-turbo
Cross-Platform, GPU Accelerated Whisper ποΈ
β1,766Updated 10 months ago
Alternatives and similar repositories for whisper-turbo:
Users that are interested in whisper-turbo are comparing it to the libraries listed below
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,570Updated 5 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,699Updated last week
- A voice chat appβ1,082Updated 2 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,080Updated last month
- β1,106Updated 6 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,499Updated 9 months ago
- ML-powered speech recognition directly in your browserβ2,696Updated 3 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β880Updated last year
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.β615Updated 8 months ago
- Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorchβ1,457Updated 2 months ago
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)β1,867Updated last month
- first base model for full-duplex conversational audioβ1,669Updated last week
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ468Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ762Updated 2 months ago
- Whisper with Medusa headsβ818Updated 2 weeks ago
- A nearly-live implementation of OpenAI's Whisper.β2,289Updated this week
- Foundational model for human-like, expressive TTSβ3,979Updated 5 months ago
- A reactive runtime for building durable AI agentsβ1,288Updated last week
- A fast multimodal LLM for real-time voiceβ2,760Updated this week
- Use your own AI models on the webβ906Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,316Updated last week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ331Updated 7 months ago
- Explore large language models in 512MB of RAMβ1,181Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,265Updated 5 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisperβ3,998Updated 3 weeks ago
- An extensible, easy-to-use, and portable diffusion web UI π¨βπ¨β1,669Updated last year
- Turn expensive prompts into cheap fine-tuned modelsβ2,526Updated 7 months ago
- Let's make sand talkβ588Updated last year
- A RAG LLM co-pilot for browsing the web, powered by local LLMsβ1,452Updated last month