Cross-Platform, GPU Accelerated Whisper ποΈ
β1,796Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for whisper-turbo
Users that are interested in whisper-turbo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,080Jan 8, 2025Updated last year
- A cross-platform browser ML framework.β758Updated this week
- β12,900Oct 25, 2025Updated 7 months ago
- Converts text input or URL into knowledge graph and displaysβ3,550Dec 24, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β22,043Apr 4, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β16,030May 18, 2026Updated last week
- Turn expensive prompts into cheap fine-tuned modelsβ2,807May 25, 2024Updated 2 years ago
- High-performance In-browser LLM Inference Engineβ18,047May 19, 2026Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β4,608Dec 14, 2025Updated 5 months ago
- ML-powered speech recognition directly in your browserβ3,324Oct 1, 2024Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β916Apr 28, 2026Updated last month
- Port of OpenAI's Whisper model in C/C++β50,237Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,263Aug 10, 2024Updated last year
- Faster Whisper transcription with CTranslate2β23,039Nov 19, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,683Apr 3, 2024Updated 2 years ago
- An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.β481Aug 21, 2025Updated 9 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,779Apr 8, 2026Updated last month
- A fast inference library for running LLMs locally on modern consumer-class GPUsβ4,531Mar 4, 2026Updated 2 months ago
- Universal LLM Deployment Engine with ML Compilationβ22,687May 11, 2026Updated 2 weeks ago
- On-device Speech AI for Apple Siliconβ6,142May 12, 2026Updated 2 weeks ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,870Apr 13, 2026Updated last month
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,394Aug 16, 2024Updated last year
- β9,664Oct 16, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ105Jun 10, 2023Updated 2 years ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β22,903May 14, 2026Updated 2 weeks ago
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app β¦β6,613Apr 11, 2026Updated last month
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,787Updated this week
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ2,015Dec 29, 2024Updated last year
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)β2,069May 20, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,146Aug 19, 2024Updated last year
- A natural language interface for computersβ63,736May 17, 2026Updated last week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,646Jul 31, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,311Mar 3, 2026Updated 2 months ago
- Inference Llama 2 in one file of pure π₯β2,124Feb 9, 2026Updated 3 months ago
- β6,742Jun 26, 2025Updated 11 months ago
- tiny vision language modelβ9,707Apr 20, 2026Updated last month
- The open-source visual AI programming environment and TypeScript libraryβ4,597May 13, 2026Updated 2 weeks ago
- β¨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.β2,455Apr 29, 2024Updated 2 years ago
- Distribute and run LLMs with a single file.β24,500May 22, 2026Updated last week