FL33TW00D / whisper-turbo
Cross-Platform, GPU Accelerated Whisper ποΈ
β1,735Updated 8 months ago
Related projects β
Alternatives and complementary repositories for whisper-turbo
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,547Updated 3 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,615Updated 3 weeks ago
- A voice chat appβ1,070Updated this week
- An Open Source text-to-speech system built by inverting Whisper.β3,982Updated 5 months ago
- Incredibly fast Whisper-large-v3β1,845Updated 9 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β871Updated 11 months ago
- A fast multimodal LLM for real-time voiceβ1,339Updated this week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ322Updated 5 months ago
- β1,094Updated 5 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,444Updated 7 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ4,962Updated 3 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β916Updated last week
- A RAG LLM co-pilot for browsing the web, powered by local LLMsβ1,411Updated 2 months ago
- turnkey self-hosted offline transcription and diarization service with llm summaryβ738Updated last month
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ730Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,092Updated this week
- A nearly-live implementation of OpenAI's Whisper.β2,060Updated 2 weeks ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ718Updated 3 months ago
- first base model for full-duplex conversational audioβ1,560Updated last week
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ444Updated last year
- Open Source framework for voice and multimodal conversational AIβ3,385Updated this week
- ML-powered speech recognition directly in your browserβ2,581Updated last month
- An extensible, easy-to-use, and portable diffusion web UI π¨βπ¨β1,668Updated last year
- Local AI API Platformβ2,111Updated this week
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)β1,841Updated this week
- A reactive runtime for building durable AI agentsβ1,278Updated last week
- β7,741Updated 5 months ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.β584Updated 6 months ago
- Build browser agents for real world tasksβ990Updated 11 months ago