Cross-Platform, GPU Accelerated Whisper ποΈ
β1,794Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for whisper-turbo
Users that are interested in whisper-turbo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β4,084Jan 8, 2025Updated last year
- A cross-platform browser ML framework.β765May 26, 2026Updated 3 weeks ago
- β12,966Oct 25, 2025Updated 7 months ago
- Converts text input or URL into knowledge graph and displaysβ3,551Dec 24, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β22,462Jun 3, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β16,108Jun 10, 2026Updated last week
- Turn expensive prompts into cheap fine-tuned modelsβ2,813May 25, 2024Updated 2 years ago
- High-performance In-browser LLM Inference Engineβ18,185Jun 9, 2026Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β4,613Dec 14, 2025Updated 6 months ago
- ML-powered speech recognition directly in your browserβ3,328Oct 1, 2024Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β916Apr 28, 2026Updated last month
- Port of OpenAI's Whisper model in C/C++β50,829Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,288Aug 10, 2024Updated last year
- Faster Whisper transcription with CTranslate2β23,584Nov 19, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,686Apr 3, 2024Updated 2 years ago
- An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.β481Aug 21, 2025Updated 9 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,793Apr 8, 2026Updated 2 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUsβ4,550Mar 4, 2026Updated 3 months ago
- Universal LLM Deployment Engine with ML Compilationβ22,792May 11, 2026Updated last month
- On-device Speech AI for Apple Siliconβ6,213Jun 10, 2026Updated last week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,903Apr 13, 2026Updated 2 months ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,567Aug 16, 2024Updated last year
- β9,662Oct 16, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ104Jun 10, 2023Updated 3 years ago
- Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.β23,318May 14, 2026Updated last month
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app β¦β6,628Apr 11, 2026Updated 2 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ20,840Updated this week
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ2,014Dec 29, 2024Updated last year
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)β2,072May 20, 2026Updated 3 weeks ago
- π Text-Prompted Generative Audio Modelβ39,161Aug 19, 2024Updated last year
- A lightweight coding agent for open models like Deepseek, Kimi, and Qwenβ64,038Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,645Jul 31, 2024Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,377Mar 3, 2026Updated 3 months ago
- Inference Llama 2 in one file of pure π₯β2,124Feb 9, 2026Updated 4 months ago
- β6,744Jun 26, 2025Updated 11 months ago
- tiny vision language modelβ9,770Apr 20, 2026Updated last month
- The open-source visual AI programming environment and TypeScript libraryβ4,608Updated this week
- β¨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.β2,454Apr 29, 2024Updated 2 years ago
- Distribute and run LLMs with a single file.β24,950Jun 9, 2026Updated last week