Cross-Platform, GPU Accelerated Whisper ποΈ
β1,801Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for whisper-turbo
Users that are interested in whisper-turbo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,068Jan 8, 2025Updated last year
- A cross-platform browser ML framework.β756Apr 2, 2026Updated 2 weeks ago
- β12,443Oct 25, 2025Updated 5 months ago
- Converts text input or URL into knowledge graph and displaysβ3,547Dec 24, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β21,363Apr 4, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β15,845Apr 12, 2026Updated last week
- Turn expensive prompts into cheap fine-tuned modelsβ2,791May 25, 2024Updated last year
- High-performance In-browser LLM Inference Engineβ17,790Updated this week
- An Open Source text-to-speech system built by inverting Whisper.β4,590Dec 14, 2025Updated 4 months ago
- ML-powered speech recognition directly in your browserβ3,299Oct 1, 2024Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β913Apr 11, 2026Updated last week
- Port of OpenAI's Whisper model in C/C++β48,661Mar 29, 2026Updated 3 weeks ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,240Aug 10, 2024Updated last year
- Faster Whisper transcription with CTranslate2β22,222Nov 19, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,688Apr 3, 2024Updated 2 years ago
- An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.β479Aug 21, 2025Updated 7 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,775Apr 8, 2026Updated last week
- A fast inference library for running LLMs locally on modern consumer-class GPUsβ4,497Mar 4, 2026Updated last month
- Universal LLM Deployment Engine with ML Compilationβ22,482Updated this week
- On-device Speech Recognition for Apple Siliconβ5,989Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,838Apr 13, 2026Updated last week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,043Aug 16, 2024Updated last year
- β9,656Oct 16, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ105Jun 10, 2023Updated 2 years ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β22,141Apr 12, 2026Updated last week
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app β¦β6,541Apr 11, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,588Apr 10, 2026Updated last week
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)β2,064Updated this week
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ2,012Dec 29, 2024Updated last year
- π Text-Prompted Generative Audio Modelβ39,073Aug 19, 2024Updated last year
- A natural language interface for computersβ63,151Updated this week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,177Mar 3, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β6,745Jun 26, 2025Updated 9 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,643Jul 31, 2024Updated last year
- Inference Llama 2 in one file of pure π₯β2,119Feb 9, 2026Updated 2 months ago
- tiny vision language modelβ9,575Nov 14, 2025Updated 5 months ago
- The open-source visual AI programming environment and TypeScript libraryβ4,536Mar 20, 2026Updated 3 weeks ago
- β¨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.β2,446Apr 29, 2024Updated last year
- A RAG LLM co-pilot for browsing the web, powered by local LLMsβ1,513Jan 26, 2025Updated last year