Cross-Platform, GPU Accelerated Whisper ποΈ
β1,806Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for whisper-turbo
Users that are interested in whisper-turbo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,057Jan 8, 2025Updated last year
- A cross-platform browser ML framework.β750Nov 23, 2024Updated last year
- β11,965Oct 25, 2025Updated 5 months ago
- Converts text input or URL into knowledge graph and displaysβ3,544Dec 24, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,952Updated this week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β15,608Mar 18, 2026Updated last week
- Turn expensive prompts into cheap fine-tuned modelsβ2,790May 25, 2024Updated last year
- High-performance In-browser LLM Inference Engineβ17,659Updated this week
- An Open Source text-to-speech system built by inverting Whisper.β4,583Dec 14, 2025Updated 3 months ago
- ML-powered speech recognition directly in your browserβ3,283Oct 1, 2024Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β913Jan 2, 2026Updated 2 months ago
- Port of OpenAI's Whisper model in C/C++β47,963Mar 21, 2026Updated last week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,227Aug 10, 2024Updated last year
- Faster Whisper transcription with CTranslate2β21,765Nov 19, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,688Apr 3, 2024Updated last year
- An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.β479Aug 21, 2025Updated 7 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,777Mar 3, 2026Updated 3 weeks ago
- A fast inference library for running LLMs locally on modern consumer-class GPUsβ4,476Mar 4, 2026Updated 3 weeks ago
- Universal LLM Deployment Engine with ML Compilationβ22,282Updated this week
- On-device Speech Recognition for Apple Siliconβ5,842Mar 19, 2026Updated last week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,786Mar 23, 2026Updated last week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,896Aug 16, 2024Updated last year
- β9,669Oct 16, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ105Jun 10, 2023Updated 2 years ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β21,783Updated this week
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app β¦β6,501Mar 23, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,506Mar 1, 2026Updated 3 weeks ago
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)β2,053Updated this week
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ2,014Dec 29, 2024Updated last year
- π Text-Prompted Generative Audio Modelβ39,066Aug 19, 2024Updated last year
- A natural language interface for computersβ62,853Feb 9, 2026Updated last month
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,112Mar 3, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β6,748Jun 26, 2025Updated 9 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,645Jul 31, 2024Updated last year
- Inference Llama 2 in one file of pure π₯β2,121Feb 9, 2026Updated last month
- tiny vision language modelβ9,455Nov 14, 2025Updated 4 months ago
- The open-source visual AI programming environment and TypeScript libraryβ4,515Mar 20, 2026Updated last week
- AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.β4,339Jul 29, 2024Updated last year
- β¨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.β2,450Apr 29, 2024Updated last year