Cross-Platform, GPU Accelerated Whisper ποΈ
β1,801Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for whisper-turbo
Users that are interested in whisper-turbo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,078Jan 8, 2025Updated last year
- A cross-platform browser ML framework.β757Apr 2, 2026Updated last month
- β12,871Oct 25, 2025Updated 6 months ago
- Converts text input or URL into knowledge graph and displaysβ3,548Dec 24, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β21,760Apr 4, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β15,950May 1, 2026Updated last week
- Turn expensive prompts into cheap fine-tuned modelsβ2,795May 25, 2024Updated last year
- High-performance In-browser LLM Inference Engineβ17,937Updated this week
- An Open Source text-to-speech system built by inverting Whisper.β4,602Dec 14, 2025Updated 4 months ago
- ML-powered speech recognition directly in your browserβ3,317Oct 1, 2024Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β913Apr 28, 2026Updated last week
- Port of OpenAI's Whisper model in C/C++β49,414May 2, 2026Updated last week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,247Aug 10, 2024Updated last year
- Faster Whisper transcription with CTranslate2β22,691Nov 19, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,689Apr 3, 2024Updated 2 years ago
- An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.β481Aug 21, 2025Updated 8 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,776Apr 8, 2026Updated last month
- A fast inference library for running LLMs locally on modern consumer-class GPUsβ4,514Mar 4, 2026Updated 2 months ago
- Universal LLM Deployment Engine with ML Compilationβ22,598Apr 22, 2026Updated 2 weeks ago
- On-device Speech AI for Apple Siliconβ6,053May 1, 2026Updated last week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,861Apr 13, 2026Updated 3 weeks ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,208Aug 16, 2024Updated last year
- β9,657Oct 16, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ105Jun 10, 2023Updated 2 years ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β22,557Apr 12, 2026Updated 3 weeks ago
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app β¦β6,581Apr 11, 2026Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,707Apr 24, 2026Updated 2 weeks ago
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ2,014Dec 29, 2024Updated last year
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)β2,069Apr 27, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,105Aug 19, 2024Updated last year
- A natural language interface for computersβ63,389Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,644Jul 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,242Mar 3, 2026Updated 2 months ago
- Inference Llama 2 in one file of pure π₯β2,121Feb 9, 2026Updated 3 months ago
- β6,749Jun 26, 2025Updated 10 months ago
- tiny vision language modelβ9,651Apr 20, 2026Updated 2 weeks ago
- The open-source visual AI programming environment and TypeScript libraryβ4,563May 1, 2026Updated last week
- β¨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.β2,446Apr 29, 2024Updated 2 years ago
- Distribute and run LLMs with a single file.β24,349May 1, 2026Updated last week